Lecture Notes 15 Econ
29000, Principles of Statistics Kevin R Foster, CCNY Fall
2011 |
|
When you have a 0/1 variable as your dependent "y" variable...
Use SPSS "Binary Logistic"
Binary Dependent Variable Models
Since the slope, , the change in probability per change in X-variable,
is always changing, the simple coefficients of the linear model cannot be
interpreted as the slope, as we did in the OLS model. (Just like when we added a squared term, the
interpretation of the slope got more complicated.)
Return to the picture to make this much clearer:
The slope at X1 is rather low; the slope at X2 is much steeper.
The effect of the coefficients now interacts with all of the other variables in the model: for example the effect of a person's gender on their probability of having health insurance will depend on other factors like their age and educational level. Women are generally less likely to have their own insurance than men, but how much less? Among young people with very low education, neither men nor women are very likely to be insured; among older people with very high education both are very likely insured. The biggest difference is toward the middle.
For example, very simple logit estimations on the CPS 2008 dataset gives the following coefficient estimates (I am suppressing notation on significance since it is not important here):
|
Logit |
|
female |
-0.428 |
|
afam |
0.220 |
|
asian |
0.252 |
|
Amindian |
0.012 |
|
Hispanic |
-0.028 |
|
ed_hs |
0.987 |
|
ed_smcol |
1.180 |
|
ed_coll |
1.652 |
|
ed_adv |
1.927 |
|
marrd |
0.492 |
|
divwidsp |
0.875 |
|
union |
1.336 |
|
veteran |
0.088 |
|
immig |
-0.277 |
|
imm2gen |
-0.067 |
|
Intercept |
-1.303 |
|
The probability of having health insurance varies for different socioeconomic groups. We can interpret the signs in a straightforward way: the negative coefficients on the "female" variable indicate that women are less likely to have health insurance. Surprisingly, African-Americans are more likely, along with Asians and Native Americans (although the last is not significant). Hispanics are less likely although this is also not significant.
But how large are these differences? For example, how much less likely to have health care are immigrants? It depends on the other variables. Intuitively, if a person is male, highly-educated, married, and unionized then he's probably insured (being an immigrant would them only slightly less so). So the change in probability associated with immigrant status would be low. At the opposite end, a woman without even a high school diploma, who is single, might already be unlikely to be insured. Immigrant status hardly changes this. Only in the middle will there be a big effect.
We can calculate it straightforwardly, though.
Consider, say, a non-immigrant
woman with an advanced degree, whose predicted probability of having health
insurance is =
=
Summing the 3 relevant
coefficients (the intercept, female, and an advanced degree) gives a logit probability of . For an
otherwise-identical immigrant woman (also with an advanced degree) the
probability is 0.4796, so the change in probability is about 7%.
Compare the change in probabilities for a married male with an advanced degree who is a union member, who is either an immigrant or not. Now the probability of having insurance is, by the logit, 0.9206 for the non-immigrant and 0.8979 for the immigrant, a change of just 2.3%. From the probit the estimated probabilities are 0.9298 for the non-immigrant and 0.9045 for the immigrant, a change of 2.5%. This is because a married male with an advanced degree who is a union member is already highly likely to have health insurance, so the difference of being an immigrant or not makes only a small change compared with the previous example of a female with a high education (but unmarried and not in a union).