*S*tata: Set Up

### Stata Windows

Introduction to Stata windows: restoring closed windows, personalizing window sizes, font sizes, and saving preferences.

There are 4 main windows in STATA

- Variables - Lists variables and labels. Double click on a variable in this window to paste it into the command window
- Command - Type in commands here
- Results - Displays results generated from commands
- Review - Keeps a history of all past commands executed. Click on a past command to paste it into the command window.

### Value Labels

Exploring data layout, variable types, and use of variable and value labels

- Variable labels give a fuller description of variables
- Text variables stored in string format, numerical variables stores as int, float, byte, etc.
- Store a variable with numerical value and attach text labels to each value by creating a value label and applying it to the variable

### Dummy Variables from Categorical Var.

Dummy Variables for each category take values "0" or "1"

*S*tata*:* Data Management & Analysis

### Tables and Cross Tabulation

One-way and two-way tables, incorporating missing data, and including relative frequencies

- Tabulations best used for categorical variables

### t-test for independent Samples

Two-group mean comparison test

- Plotting graphs gives an idea of distributions of data
- ttest gives results for one-tail and two-tail tests

*S*tata: Graphs

### Using Graphs

Using Graphs to understand your data through histogram, scatterplot, and bar graph

- Use graphs to compare statistics across categories

### Customizing Graphs

Using the Graph Editor to customize labels, colour schemes, design, etc.

- Saving graphs and pasting to Word documents
- Generate separate graphs by category

### Standardizing Graphs

Record and save customizations to generate a standard format for all your graphs

*S*tata*:* Regression Analysis

### OLS: Ordinary Least Square Regression

Review of Ordinary Least Squares (OLS) simple linear regression and the constant coefficient

- STATA generates the constant term by regressing on x=1

### Multicollinearity & Reference Groups in an OLS Regression

Regressing with dummy variables and choosing a reference group

- Generate dummies to capture different effects of each category
- Choose one reference group (or constant) to omit to avoid multicollinearity
- Coefficients measure effects relative to reference group