Possible Solutions for
Homework 11 Econ 29000 Kevin R Foster, CCNY |
|
This exercise will use the version of the CPS dataset
online, CPS_hw11, which is a version of the 2010 March CPS data from the US
Bureau of Labor Statistics. (It is a
zipped file so first download it and then unzip before you open it with SPSS.) It collects a variety of information, which
you are welcome to look through as you consider final project topics.
Use "Data\Select Cases" to choose only people
with Total Wage and Salary greater than zero.
1.
Please list the names of the
people in your study group.
2.
Use SPSS to run a simple linear
regression (choose Analyze\Regression\Linear) with the dependent variable as Total
Wages and Salary and the independent variable Age.
a.
What is the estimated value for
β0? Is it statistically
significantly different from zero?
b.
What is the estimated value for
β1? Is it statistically
significantly different from zero?
The SPSS output is:
Coefficientsa |
||||||
Model |
Unstandardized Coefficients |
Standardized
Coefficients |
t |
Sig. |
||
B |
Std. Error |
Beta |
||||
1 |
(Constant) |
12698.652 |
489.894 |
|
25.921 |
.000 |
Demographics, Age |
716.714 |
11.411 |
.196 |
62.809 |
.000 |
|
a. Dependent Variable: Total wage and
salary earnings amount - Person |
The β0 (beta-zero or beta naught) coefficient estimate is $12,698.65 – this implies that people are predicted to start from just under $13,000 per year when born and then add wages as they age. It is statistically significantly different from zero: the standard error is just 490, so the coefficient estimate is almost 26 times larger than the standard error. The p-value is less than 0.000 (certainly less than 0.05).
The β1 (beta-one) coefficient estimate is $717 – implying that people get an average of $717 more in wages & salary for each year that they get older. This is also statistically significant; its standard error is just 11; the coefficient estimate is almost 63 times larger. The p-value is nearly zero, certainly less than 0.05.
c.
What is the predicted value for
someone who is 25 years old? 45?
The predicted wage is $12,699 + 717*Age so someone who is 25 is predicted to earn $30,607; someone 45 years old would be predicted to earn $44,951.
d.
If you had not selected only
people with non-zero wages, how would that change the estimates? (You can undo the "select cases"
and see for yourself.)
e.
How different would the
estimates be, if you had selected people with non-zero wages and also who were
"prime-age" so 25-55 years old?
f.
Again use "select
cases" to do a regression on just prime-age women. Then do a separate regression for prime-age
men. What are the differences?
These are shown in the table below.
wage > 0 |
all |
prime age |
prime age men |
prime age
women |
|
β0 |
12699 |
6675 |
16404 |
13015 |
19631 |
β1 |
717 |
376 |
752 |
1068 |
428 |
Clearly adding in all of the zero wages drags down both coefficient estimates. People at "prime age" have a similar slope but a higher level of wages (suggestive of curvature); men start lower than women but accrue increases at a faster rate.
3.
Next estimate a linear
regression with the same dependent variable, but now add the "female"
dummy variable and education variables (you can choose which ones). Interpret the regression coefficients: which
are statistically significant? (You can
choose whether to do prime-age or all ages, but you'll need both men and
women.)
These estimates are:
wage>0 |
prime age |
|
constant |
7594 |
2809 |
Age |
495 |
723 |
Female |
-18598 |
-21275 |
just HS |
11018 |
10350 |
some college
w/o degree |
15630 |
17961 |
Assoc in
vocational |
20702 |
20050 |
Assoc in
academic |
23491 |
23088 |
4-yr degree |
38798 |
39500 |
Adv degree |
65299 |
65435 |
Women are predicted to do even worse if I just focus on prime-age; the educational premiums are quite close for both groups however.