Эконометрика — Совбак ВШЭ и РЭШ, 2021 final
Question 1
Part I: The trend toward later retirement — 20 points
The labor-force participation rate, LFPR, is the fraction of people who are employed or looking for work. From the mid-1990s through approximately 2009, the LFPR of Americans aged 55 and over, , rose from roughly 30% to more than 40%. Since 2010 it has plateaued.
The four-quarter change is
The part examines whether the slowdown was caused by the 2008-2009 recession using unemployment, real disposable personal-income growth, and housing starts.
Table 1. Variable definitions and summary statistics
Unit: U.S. quarterly data, 1995Q1-2015Q3, .
| Variable | Definition | Mean | Std. dev. |
|---|---|---|---|
| LFPR for U.S. residents aged 55 and over | 36.2 | 3.8 | |
| 0.12 | 0.22 | ||
| 0.47 | 0.48 | ||
| Unemployment Rate | Unemployed as percentage of labor force | 6.0 | 1.7 |
| Real Income | After-tax personal income, billions of 2010 dollars | 9,989 | 1,616 |
| Quarterly log change | 0.0069 | 0.0099 | |
| Housing Starts | New home construction started in the quarter, millions | 1.334 | 0.471 |
| First difference | -0.0038 | 0.084 |
Table 2. Forecasting regressions for
All regressions use quarterly data from 1995Q1 through 2007Q4. Standard errors are in parentheses. HR denotes heteroskedasticity-robust; HAC denotes heteroskedasticity- and autocorrelation-consistent.
| Regressor/statistic | (1) | (2) | (3) | (4) | (5) |
|---|---|---|---|---|---|
| Standard errors/tests | HR | HAC | HAC | HAC | HAC |
| 0.138<br>(0.245) | 0.138<br>(0.211) | -0.059<br>(0.149) | 0.169<br>(0.244) | 0.024<br>(0.313) | |
| 0.380<br>(0.289) | 0.380<br>(0.234) | 0.216<br>(0.148) | 0.409<br>(0.245) | 0.212<br>(0.367) | |
| 0.430+<br>(0.255) | 0.430*<br>(0.166) | 0.269*<br>(0.135) | 0.458**<br>(0.154) | 0.270<br>(0.224) | |
| Unemployment Rate | — | — | 0.756**<br>(0.217) | — | — |
| Unemployment Rate | — | — | -0.236<br>(0.332) | — | — |
| Unemployment Rate | — | — | -0.675**<br>(0.215) | — | — |
| — | — | — | -3.08<br>(3.16) | — | |
| — | — | — | -0.205<br>(6.79) | — | |
| — | — | — | 7.65<br>(9.64) | — | |
| — | — | — | — | 1.01**<br>(0.30) | |
| — | — | — | — | -0.89<br>(0.61) | |
| — | — | — | — | 0.51<br>(0.38) | |
| Constant | 0.497**<br>(0.088) | 0.497**<br>(0.112) | 1.447**<br>(0.412) | 0.444<br>(0.291) | -0.460<br>(0.712) |
| BIC | -1.641 | -1.641 | -2.039 | -1.433 | -1.537 |
| Adjusted | 0.021 | 0.021 | 0.441 | 0.002 | 0.077 |
| RMSFE, 2008Q1-2015Q3 | 0.558 | 0.558 | 0.685 | 0.554 | 0.381 |
| Observations | 52 | 52 | 52 | 52 | 52 |
| : three lagged LFPR changes | 1.51, | 4.55, | 3.33, | 6.40, | 2.33, |
| : three unemployment terms | — | — | 11.76, | — | — |
| : three income-growth terms | — | — | — | 1.20, | — |
| : three housing-start terms | — | — | — | — | 4.21, |
, , and denote significance at 10%, 5%, and 1%.
1 — 4 points
Regressions (1) and (2) differ only in how standard errors are computed. Which method is preferred here, or does it not matter in theory? Explain.
2
You must forecast the change in LFPR55 from 2015Q3 to 2016Q3 using one of regressions (2)-(5).
- (4 points) Which specification would you choose, and why?
- (4 points) Give the standard error of the forecast. You need not compute the forecast itself.
3 — 8 points
Assess the hypothesis that the post-2010 plateau in LFPR55 was caused by the recession and slow recovery. Absent the recession, older workers might have continued postponing retirement; because of job losses and difficulty finding work, many may instead have retired earlier.
Using Figures 1-6 and Table 2, write one or two paragraphs evaluating whether the evidence supports, contradicts, or is insufficient to assess this hypothesis.
Question 2
Part II: Bank lending and small-business profitability — 21 points
The Bank of Russia wants to estimate the effect of bank lending on small-business profitability. It has data on one hundred businesses over five years, described in the source as 50,000 observations:
- : profitability;
- : total bank loans received;
- : total government loans received;
- : region dummies, ;
- : line-of-business dummies, ;
- : unemployment;
- : distance to the nearest bank branch;
- : number of business closures in the city;
- : number of business openings in the city.
Business and year dummies and a time trend can also be created. The baseline regression is
For each concern, explain briefly how you would address it.
- (3 points) Only businesses surviving all five years are included.
- (3 points) Larger bank loans are usually granted to businesses more likely to be profitable.
- (3 points) A loan may have a positive long-run effect but initially be a burden because repayments begin before benefits are realized.
- (3 points) Government-loan recipients may use bank loans more effectively because the government-loan program includes business education.
- (3 points) Only businesses that received loans are in the data set.
- (3 points) Doubling loan size more than doubles the expected benefit.
- (3 points) Unobserved, time-invariant business characteristics, such as management quality, may affect both loan access and profitability.
Question 3
Part III: True, false, or uncertain — 24 points
For each statement, answer true, false, or uncertain, with an explanation.
(a) (4 points) In a probit or logit model, marginal effects on the probability that the dependent variable equals one are functions of . Therefore, it is difficult to give alone a useful interpretation.
(b) (4 points) You want to estimate the supply equation for snow boots
where is quantity sold, is price, and is transportation cost from a central production facility. Annual snowfall is proposed as an instrument for . The fact that snowfall is correlated with proves that the instrument is invalid.
(c) (4 points) In a panel model where regressors are correlated with individual fixed effects, the fixed effects should be omitted to avoid multicollinearity.
(d) (4 points) The same instrument can be used for two different endogenous regressors in one regression.
(e) (4 points) OLS with cross-sectional data is bad because heteroskedasticity biases both slope estimates and standard errors.
(f) (4 points) It is possible to construct standard errors robust to unknown heteroskedasticity, but not standard errors robust to both heteroskedasticity and autocorrelation.
Question 4
Part IV: Miscellaneous problems — 35 points
1. Omitted variables and instruments — 15 points
Suppose
and the main parameter of interest is . Assume and are uncorrelated and, in fact,
- (5 points) Without data on , can you consistently estimate ? Explain.
- (5 points) Suppose is endogenous and remains unobserved. A proposed instrument is a noisy linear combination of and , for example Discuss whether is a suitable instrument and the properties of the IV estimator.
- (5 points) If is observed, is there a way to obtain a consistent estimate of ? Give a specific procedure or explain why not.
2. Cigarette-demand IV estimates — 20 points
The observations are the 48 continental U.S. states. Let denote cigarette sales, a state price index, and average income.
Using log state sales tax as an instrument for log price gives
with standard errors
under the coefficients on and .
- (2 points) Construct a 95% confidence interval for the price elasticity.
- (2 points) Test the significance of the income coefficient.
Adding log state cigarette tax as a second instrument gives
with standard errors
- (1 point) Construct a 95% confidence interval for the price elasticity.
- (5 points) Compare the coefficients across the two specifications to test instrument validity. You may use the fact that the variance of their difference can be estimated as the difference of their estimated variances, with the second specification having the lower variance.
Using a second year of data, first differences, and the differences of the same two instruments gives
with standard errors
- (5 points) Write a model in which this differenced IV estimator is consistent but the previous level estimators are not.
- (5 points) Regressing on a constant, , and the two instruments gives an statistic of 88.6, with 48 observations. Is it significant? Which of the two key IV conditions does this test confirm?