More about Final Project

Econ 29000, Principles of Statistics

 Kevin R Foster, CCNY

Spring 2011

 

 

Default Guidelines for Final Project

You don't have to follow this format – use your own if you have a better idea.  If you don't, though, this will serve as a basis.  Also please ensure that your project does not go through and give bullet points in response to each question!  You should write a narrative that gracefully includes the answers to these questions.  The quality of the writing is a large factor determining the grade you get.  There are writing tutors available – use them!

 

Introduction

A concise description of the project: include the dataset used, the key interesting results (don't reproduce everything), and why those results are interesting.  Should be about a page so every word must count!

 

Literature Review

Describe the papers you've read that also look at this topic.  Explain the differences among the results found in different previous studies.  You can point out challenges that remain (even if your project doesn't solve them all).  Do different authors come to different conclusions?  Why might this be?  Are their regressions valid (e.g. do they take adequate account of endogeneity issues)?  These should be academic papers – serious studies not newspaper accounts. You can cite a newspaper to indicate why the result is interesting (e.g. to show that policymakers or the public cares about knowing the real answer, or to give some background on why you're interested in it) but you can't end there.

 

Means (simple graphs, correlations, differences of means)

First carefully note the dataset you're using, both the original source and any subsequent restrictions (e.g. if you're only looking at children or only those who are working or whatever).  Present a table where each important variable in your regression has its mean and standard deviation as well as any other relevant summary statistics (min/max, median, whatever).  Verify that the units all make sense.

 

This is a good place for simple graphs of the sort that we talked about.  Does a two-dimensional scatterplot show your regression results?  Why or why not?  This is also a good place to discuss functional forms: does the graph show that squared or cubic terms could be useful (or logarithms)?  What about subgroups?  Medians?  (Look over past homework assignments for examples.)

 

Simple Regressions

Present a few different models in easy-to-read tables.  Don't just cut-and-paste the SPSS output!  That is unacceptable.

 

Complicated Regressions

Present some more regressions (again, in easy-to-read tables).  Show your main conclusion then do some robustness checks (i.e. what if the sample were limited to only males or females or only those of certain ages or whatever is relevant).  Go back to the homework assignments from class and do just those sorts of regressions; for example if you have age plus its square and cube, do the results (the coefficients on the variables of interest) change when you put in 5-year age dummies?

 

Explain Results

Clearly state what you have found and why it is interesting.  Do your results confirm what other researchers have found?  Or do they contradict earlier research?  Why might this be?

 

Hand in: Paper, dataset, SPSS output

 

Don't Plagiarize!  Remind yourself of the rules for academic honesty (many many previous references are available).  The consequences for violations are substantial – up to expulsion.

 


 

Example

 

Using 2010 CPS data, restrict to only fulltime workers with a non-zero wage.  Run two sets of regressions to explain earnings: with earnings (annual wage and salary) as the dependent variable; with log of earnings as the dependent.

 

The first set of basic explanatory variables is hypothesized to be factors such as age, sex, education, race/ethnicity, marital status, veteran status, and if a union member.

 

Wage/Salary (annual)

 $             49,773.79

Age

41.88

Female

44.5%

White

79.7%

African-American

11.8%

Asian-American

5.8%

Native American/ Indian/ Alaskan/ Inuit/ Hawaiian

2.8%

Hispanic

16.1%

Mexican

9.8%

Puerto Rican

1.4%

Cuban

0.6%

Immigrant

17.5%

1 or more Parents were immigrants

23.8%

Education: no high school

8.6%

Education: High School Diploma

28.9%

Education: Some College (incl no degree or Assoc degree)

27.9%

Education: Some College but no degree

17.5%

Education: Associate in vocational

5.0%

Education: Associate in academic

5.4%

Education: 4-yr degree

22.5%

Education: Advanced Degree

12.1%

Married

62.0%

Divorced or Widowed or Separated

14.8%

Unmarried

23.2%

Union member

2.2%

Veteran (any)

7.4%

 

The regression estimates are made with three basic specifications: Spec 1 has just the listed variables; Spec 2 included dummies for industry, occupation, and state of residence; Spec 3 has dummy interactions for female*age, African-American*age, female*African-American*age, Hispanic*age, female*Hispanic*age, and female*education.

 

Spec 1

Spec 2

Spec 3

estimated value

estimated value

estimated value

intercept

-$28,685.56

*

$13,744.52

*

-$10,978.43

*

1954.106

3025.180

3685.959

Age

$2,517.92

*

$2,012.04

*

$3,052.09

*

93.814

88.514

133.158

Age-squared

-$23.60

*

-$18.55

*

-$29.40

*

1.055

.994

1.504

Female

-$17,380.74

*

-$14,587.20

*

$26,912.27

*

360.019

393.294

4202.955

African American

-$6,136.77

*

-$5,315.62

*

$17,924.27

*

552.138

545.564

7559.610

Asian

-$783.89

-$3,140.09

*

-$3,196.33

*

861.879

851.007

849.324

Native American Indian or Alaskan or Hawaiian

-$4,615.72

*

-$3,077.92

*

-$3,030.05

*

1054.697

1025.422

1022.749

Hispanic

-$5,176.56

*

-$4,433.05

*

$32,492.36

*

596.068

588.188

5715.141

Immigrant

-$7,377.88

*

-$4,669.63

*

-$4,080.20

*

776.395

731.493

733.482

1 or more parents were immigrants

$4,513.48

*

$1,231.87

$892.78

718.087

677.532

677.771

Education: High School Diploma

$7,658.27

*

$3,819.68

*

$4,208.53

*

701.918

667.305

826.691

Education: Some College but no degree

$15,430.94

*

$7,791.73

*

$9,434.14

*

756.430

734.022

900.898

Education: Associate in vocational

$15,719.42

*

$8,376.06

*

$9,873.19

*

1003.190

966.454

1098.448

Education: Associate in academic

$19,907.99

*

$9,660.31

*

$11,310.63

*

978.304

948.764

1091.644

Education: 4-yr degree

$35,565.50

*

$20,756.84

*

$24,651.87

*

738.325

761.377

949.760

Education: Advanced Degree

$63,729.94

*

$40,911.95

*

$46,708.57

*

815.818

896.308

1109.431

Married

$8,100.77

*

$7,074.38

*

$6,912.90

*

486.083

459.856

459.565

Divorced or Widowed or Separated

$1,646.98

*

$1,893.12

*

$1,881.97

*

633.993

595.046

594.911

Union member

-$3,992.75

*

$2,282.96

*

$2,372.64

*

1169.615

1108.181

1105.552

Veteran (any)

-$1,186.63

-$884.41

-$905.22

687.786

648.453

659.002

R-squared

0.213

0.315

0.319

 

Discussion....