Econ B2000, MA Econometrics
Kevin R Foster, the Colin Powell School at the City College of New York, CUNY
October 22, 2019
The questions are worth 120 points. You have 120 minutes to do the exam, one point per minute.
All answers should submitted electronically. Please submit all relevant computer files. Please no “pages” files, save as pdf or rtf. I prefer .Rmd files. No need to put your name, just last digits of ID to identify yourself, so grading is blind. You may refer to your books, notes, calculator, computer, or astrology table. The exam is “open book.” However, you must not refer to anyone else, either in person or electronically, during the exam time. For instance, since these exam questions are newly created, posting questions or copying answers on online homework helping sites or forums (such as Chegg, Yahoo answers or others) is a violation. You must do all work on your own. Cheating is harshly penalized. Please silence all electronic noisemakers such as mobile phones. Good luck. Stay cool.
Estimate | Std. Error | |
---|---|---|
(Intercept) | 0.93155 | 0.23881 |
GP.Calc | 0.11545 | 0.03034 |
GP.Eco101 | 0.19374 | 0.06487 |
GP.Eco102 | 0.32428 | 0.05108 |
GP.Eco103 | -0.01309 | 0.05428 |
(20 points) In the PUMS NY data that we’ve been using in class, those with TRANWORK == 70 are working from home. Compare that group of people with those commuting on the subway (there’s a dummy Commute_subway or use TRANWORK == 33). What are the educational attainments in each group? Given that someone works from home, what is the likelihood that the person has at least a 4-year degree? Given that someone is female, what is the likelihood that she works at home? Given that someone is male, what is the likelihood that he works at home? Create a confidence interval for the difference and provide a p-value.
(40 points) With the same data, create a different regression where TRANTIME is the dependent variable (you can drop the ones with zero values). What is the effect (if any) of educational qualification on commuting time? What about other demographic effects such as age and race/ethnicity? Explain confidence intervals and p-values for each important coefficient (where your explanation determines important). Create a joint hypothesis test to find a p-value for whether all of the education dummies are all equal to zero. Calculate predicted values for some people and assess if these seem plausible. Perhaps a graph of data and predicted values.
(20 points) Create a k-nn model to try to predict how a person commutes to work. (Worry a bit about the zero values of TRANWORK since that typically means they’re not working.) How useful is this model at predicting? What are some of the important variables in this prediction?