Эконометрика — МИЭФ, 2025 midterm 1
Question 1
Multiple-choice test
Which of the following is a difference between least absolute deviations, LAD, and ordinary least squares, OLS?
-
OLS and LAD are equally sensitive to outlying observations.
-
OLS is more computationally intensive than LAD.
-
OLS is more sensitive to outlying observations than LAD.
-
OLS is justified for very large samples, while LAD is justified for smaller samples.
-
OLS estimates the conditional median of the dependent variable, while LAD estimates the conditional mean.
Question 2
Multiple-choice test
Which of the following regression models is nonlinear in parameters?
-
.
-
.
-
.
-
.
-
All regressions 1)-4) are nonlinear in parameters.
Question 3
Multiple-choice test
A linear transformation of the explanatory variable in the model
does not generally change:
-
The estimate of the intercept.
-
The estimate of the slope coefficient.
-
The standard error of the slope coefficient estimate.
-
The determination coefficient of the regression.
-
The standard error of the intercept estimate.
Question 4
Multiple-choice test
A student investigates factors affecting GDP growth for 200 countries in 2023. GDP is first measured in constant 2021 US dollars at purchasing-power parity. He then measures GDP growth using national currencies at constant prices. All regressors remain the same, and the GDP growth rate remains the dependent variable. What can be said about the two regressions?
-
All coefficients are the same.
-
Only the intercept changes; all slope coefficients remain the same.
-
All coefficients are generally different.
-
Coefficients may differ, but is the same.
-
Coefficients may differ, but is the same.
Question 5
Multiple-choice test
A student estimates the production function
where is the output growth rate, is the capital growth rate, and is the labour growth rate. She then estimates
Which statement is correct?
-
Model (1) is a restricted version of (2).
-
Model (2) is a restricted version of (1).
-
Both statements 1) and 2) are incorrect.
-
Models (1) and (2) are equivalent.
-
There is perfect multicollinearity in model (2).
Question 6
Multiple-choice test
Suppose the following model is estimated by OLS and the Gauss-Markov conditions are satisfied:
Then:
-
You can obtain an unbiased estimate of .
-
You cannot obtain an unbiased estimate of , but can obtain a consistent estimate.
-
You cannot obtain either an unbiased or a biased-but-consistent estimate of .
-
You cannot obtain any estimate of because of perfect multicollinearity.
-
All the above statements are incorrect.
Question 7
Multiple-choice test
In the regression model
where satisfies the Gauss-Markov conditions and is normally distributed, contains random measurement errors that are independent, normally distributed, homoscedastic, not autocorrelated, and have zero expected values. Suppose and the mean of is negative. For large samples:
-
The estimate of is biased upwards.
-
The estimate of is biased downwards.
-
The estimate of is unbiased.
-
The estimate of may be biased upwards or downwards.
-
The estimate of is biased upwards.
Question 8
Multiple-choice test
If OLS is used in a simple regression model with heteroscedasticity, the population variance of the slope estimator is
Under homoscedasticity,
Let , where are unknown nonnegative weights and . Then:
-
Expression (1) is always greater than (2).
-
Expression (1) is always less than (2).
-
Expression (1) is greater than or equal to (2).
-
Expression (1) is less than or equal to (2).
-
Expression (1) can be greater than, less than, or equal to (2), depending on the relationship between and .
Question 9
Multiple-choice test
Which of the following is a difference between the White test and the Breusch-Pagan test?
-
The White test detects heteroscedasticity of unknown form, while the Breusch-Pagan test detects heteroscedasticity of a specified form.
-
The Breusch-Pagan test detects heteroscedasticity of unknown form, while the White test detects heteroscedasticity of a specified form.
-
The number of regressors used in the White test is larger than the number used in the Breusch-Pagan test.
-
The number of regressors used in the Breusch-Pagan test is larger than the number used in the White test.
-
None of the above.
Question 10
Multiple-choice test
The Durbin-Wu-Hausman test can be used to detect:
I. Measurement-error bias.
II. Simultaneous-equations bias.
III. Endogeneity of explanatory variables.
-
I, II, and III.
-
II and III only.
-
I and III only.
-
I and II only.
-
II only.
Question 11
Multiple-choice test
The economic model is
\begin{aligned} y_1&=\alpha+\tau y_2+\pi x_1+\phi x_2+\varepsilon x_3+u_1, \tag{1}\\ y_2&=\beta+\mu y_1+\gamma x_1+u_2. \tag{2} \end{aligned}Here and are endogenous variables; , , and are exogenous variables; and , satisfy the Gauss-Markov conditions. Indicate the correct statement:
-
You may apply TSLS to (1), but not to (2).
-
You may apply TSLS to (2), but not to (1).
-
You may apply TSLS to both (1) and (2).
-
You may not apply TSLS to either (1) or (2).
-
TSLS is not needed because OLS provides consistent estimates in both equations.
Question 12
Multiple-choice test
For a simultaneous equations model with 9 equations, 9 endogenous variables, and 8 exogenous variables, which statement is true for any equation?
-
An equation is likely to be underidentified if 10 variables are missing from it.
-
An equation is likely to be exactly identified if 9 variables are missing from it.
-
An equation is likely to be overidentified if 9 variables are missing from it.
-
An equation is likely to be exactly identified if 10 variables are missing from it.
-
An equation is likely to be underidentified if 11 variables are missing from it.
Part 2. Free Response Questions — 2 hours.
Section A. Answer all questions from this section (original Questions 1-2).
Question 13
Written Question 1 — 25 marks
An ICEF student preparing her diploma examines the dependence of painting prices , in thousands of dollars, on the painting's age , measured in decades from the current year to the year in which the work was created, and canvas size , measured in square feet. Data on 19 paintings sold at one auction are used to estimate:
with standard errors
(a) (13 marks)
- Interpret the coefficients of equation (1).
- Evaluate the significance of the individual coefficients and of equation (1) as a whole.
- is insignificant, and the student considers excluding it. What are the likely consequences, given that the correlation between and is ?
The supervisor recommends adding provenance information. Let if the painting has documented provenance and otherwise. The student estimates
with standard errors
- Why are the intercepts different in equations (1) and (2)?
- What does the coefficient on mean? Is it significant?
- What assumptions about the coefficients on and are implicit in equation (2)?
(b) (12 marks) The supervisor also recommends adding and :
with standard errors
- What is the meaning of this recommendation? How do the marginal effects differ for paintings with and without provenance?
- Is provenance statistically significant in equation (3)?
- The alternative approach is a Chow test. Suppose equation (1), estimated separately for paintings without provenance, gives , while for paintings with provenance it gives . Explain the Chow test and compare its result with the previous test.
Question 14
Written Question 2 — 25 marks
A student studies selling prices of 400 houses in the Moscow region, measured in thousands of roubles. The data include property size in square metres, number of bedrooms, and a dummy for air conditioning.
The estimated equation is
Conventional standard errors are
and heteroscedasticity-robust standard errors are
(a) (13 marks)
- What is the economic meaning of the coefficient on ?
- Interpret the coefficient on .
- Interpret the coefficient on .
- What is heteroscedasticity? Why is it likely in this sample? What are its consequences for econometric estimation?
- What are heteroscedasticity-consistent standard errors and what are they used for? Comment on the differences between the two sets of standard errors. Are the coefficient estimates significant?
(b) (12 marks) The student observes that the variance of increases with and assumes
in addition to the Gauss-Markov assumptions and normality.
- To test for heteroscedasticity, she orders the 400 observations by size. Estimating equation (1) on the 150 observations with the smallest gives . Estimating it on the 100 observations with the largest gives . The unequal sample sizes are used because there are fewer high-priced properties. Carry out the appropriate test, state the hypotheses, calculate the statistic, and draw a conclusion. What are the consequences of violating the equal-subsample-size rule?
- A friend recommends White's test. For the auxiliary equation with cross terms, the student obtains . Complete the test, including its distribution, degrees of freedom, critical value, and conclusion.
Section B. Answer one question from this section (original Question 3 or Question 4).
Question 15
Written Question 3 — 25 marks
Consider two regression models without an intercept:
where
and and are non-stochastic.
(a) (10 marks) Let (1) be the true model, but estimate (2) by OLS:
- Show that
is biased.
- Which factors determine the direction of the bias?
- Under what conditions is there no bias?
(b) (8 marks) Consider
and
with the same disturbance assumptions, non-stochastic , and .
- Let (3) be true and estimate (4) by OLS. Rewrite (3) as , where . Using part (a), find the omitted-variable bias. How does affect its direction, and together with which other factors?
- Now let (4) be true and estimate (3) by OLS. Briefly describe the consequences for the properties of .
(c) (7 marks) Obtain the expression for
by directly comparing
from regression (4) with
from regression (3).
Question 16
Written Question 4 — 25 marks
Consider the following simultaneous-equations model of equilibrium wheat consumption:
The errors and are i.i.d. with zero means and constant variances. Here:
- is per-capita wheat consumption, in kilograms;
- is the domestic price per kilogram;
- is per-capita income;
- is sunshine hours or solar radiation;
- is the number of severe flooding events during the last wheat-growing season.
The variables , , and are exogenous; and are endogenous. The model is estimated using cross-sectional data.
(a) (10 marks)
- Explain endogenous and exogenous variables.
- Using the market-clearing condition , derive reduced-form equations for and .
- Briefly show why estimating equations (1) and (2) by OLS is problematic. A derivation of the large-sample bias is not required.
(b) (8 marks)
- What does it mean for an econometric equation to be identified or not identified? The order condition is not required for this definition.
- Determine whether equations (1) and (2) are exactly identified, underidentified, or overidentified using the order condition.
- A seminar participant proposes adding
so the supply equation becomes
What is the rationale for this proposal? What signs should be expected for the coefficients on and ?
- How does adding change identification of equations (1) and (2*)?
(c) (7 marks) Which method can be used to obtain consistent estimates of equation (1)? Briefly describe how to apply it.