Эконометрика — МИЭФ, 2023 final
Question 1
Multiple-choice test
For the model
where are non-stochastic and the Model A assumptions are satisfied, the following three estimators of are proposed:
The following is correct for these estimators:
-
All the estimators , , and are unbiased.
-
All the estimators , , and are biased.
-
The estimator is unbiased, while and are biased.
-
The estimators and are unbiased, while is biased.
-
The estimators and are unbiased, while is biased.
Question 2
Multiple-choice test
Which of the following correctly identifies an advantage of using adjusted over ?
-
Adjusted corrects the bias in .
-
Adjusted is easier to calculate than .
-
The penalty of adding new independent variables is better understood through adjusted than .
-
Adjusted can be calculated for models having logarithmic functions, while cannot be calculated for such models.
-
None of the above is correct.
Question 3
Multiple-choice test
A student estimated by OLS the production function
where is the output growth rate, is the capital growth rate, and is the labour growth rate. Then he decided to estimate by OLS the function
Which statement of the following ones is correct?
-
.
-
.
-
.
-
.
-
.
Question 4
Multiple-choice test
If you have estimated the parameters of the following model using OLS directly, with the Gauss-Markov conditions satisfied,
then:
-
You can get an unbiased estimate of .
-
You cannot get an unbiased estimate of , but can get a consistent estimate of it.
-
You cannot get an unbiased, or biased but consistent, estimate of .
-
You cannot get any estimate of .
-
All the above statements are incorrect.
Question 5
Multiple-choice test
Which of the following correctly defines the statistic for testing linear restrictions if represents the coefficient of determination from the restricted model, represents the coefficient of determination from the unrestricted model, and is the number of restrictions imposed?
- None of the above.
Question 6
Multiple-choice test
The following double-logarithmic model is estimated:
The interpretation of the coefficient is the following:
-
If increases by one unit, then increases approximately by percent.
-
If increases by one unit, then increases approximately by percent.
-
If increases by one percent, then increases approximately by percent.
-
If increases by one percent, then increases approximately by percent.
-
If increases by one percent, then increases approximately by units.
Question 7
Multiple-choice test
An econometric model is described by the following three equations:
\begin{aligned} y_1&=\alpha+\beta y_3+\gamma x_1+\sigma x_3+\pi x_4+u_1, \tag{1}\\ y_2&=\delta+\varepsilon y_1+\lambda x_2+u_2, \tag{2}\\ y_3&=\mu+\theta y_1+\omega y_2+\rho x_3+\chi x_4+u_3. \tag{3} \end{aligned}Here , , and are endogenous variables; , , , and are exogenous variables; and , , and are disturbance terms, independent and satisfying the Gauss-Markov conditions. Choose the correct statement:
-
Equation (2) is exactly identified.
-
Equation (1) is overidentified.
-
Equation (3) is underidentified.
-
Equation (1) is exactly identified.
-
Equation (2) is underidentified.
Question 8
Multiple-choice test
The model with the dependent variable , monthly pension, as a function of work experience and average earnings is being considered:
The value of pension is restricted by the values and from the top and from the bottom, but there are no actual observations in the sample with or . The student decided to estimate a Tobit model with the truncated sample, with all observations on the upper or lower bounds excluded. Please indicate the correct statement among the following ones:
-
The estimated coefficients are biased but consistent.
-
The estimated coefficients are biased and inconsistent.
-
The estimated coefficients are unbiased.
-
For the truncated sample, OLS estimation would provide unbiased estimates.
-
None of the above.
Question 9
Multiple-choice test
The following model of determination of the size of dividends is considered:
where is the desirable size of the dividends, is the current profits, is the actual size of the dividends, and . The following statement is correct. The model is:
-
The adaptive expectations model and can be consistently estimated in the form of the Koyck distribution model.
-
The partial adjustment model and can be consistently estimated in the form of the ADL(1,0) model.
-
The partial adjustment model and can be consistently estimated in the form of the ADL(0,1) model.
-
The error correction model and can be consistently estimated in the form of the ADL(1,1) model.
-
The error correction model and can be consistently estimated in the form of the ADL(1,0) model.
Question 10
Multiple-choice test
Refer to the following model:
Here represents:
-
The short-run change in given a temporary increase in .
-
The short-run change in given a permanent increase in .
-
The long-run change in given a permanent increase in .
-
The long-run change in given a temporary increase in .
-
None of the above.
Question 11
Multiple-choice test
Indicate the incorrect statement among the following ones:
-
If is a random walk with drift, the series of first differences , where is white noise, is stationary.
-
The time trend is a non-stationary series.
-
The MA(1) process is stationary.
-
The AR(1) process , with , is asymptotically stationary.
-
The stationarity of an ARMA process is determined by its MA part.
Question 12
Multiple-choice test
In the model based on panel data
random effect estimation is based on the following assumptions:
I. There are no variables that are fixed for each individual.
II. There is some unobserved heterogeneity in the model.
III. Each of the unobserved variables is treated as being drawn randomly from a given distribution.
IV. The variables are correlated with some of the variables.
V. The variables are distributed independently of all of the variables.
-
I, III and IV only.
-
II, III and V only.
-
II and III only.
-
I, III and V only.
-
III and IV only.
Question 13
Written part, Section A — original Question 1 — 25 marks
Part 2. Written examination. One session, 2 hours without break.
SECTION A. Answer all questions 1-2 from this section.
Working on her coursework, a student of ICEF interviewed ICEF graduates of different graduation years working in Russia. She is interested in studying their current earning, , in thousands of rubles per month. Explanatory variables are , age of respondent in years, age squared , and also some dummy variables: , equal to 1 for those graduates who have received a master's degree abroad and 0 otherwise; , no further education, equal to 1 for those graduates who received a master's degree neither abroad nor in the country; and , equal to 1 for male and 0 for female. She posted a questionnaire on the Internet and received answers from 41 graduates of different years of graduation from ICEF. Here are the results of estimation of two regressions using different sets of variables. Standard errors are in brackets.
Standard errors:
Standard errors:
(a) (12 marks)
-
How many groups of dummy variables does equation (2) contain? How many categories of education level of ICEF graduates do dummy variables and describe? What is the reference category in each of the groups of dummy variables?
-
Help the student estimate the expected earnings of ICEF graduates of different categories presented in equations (1) and (2) for a person 25 years old. Why are the coefficients of equations (1) and (2) different, and what is the difference in the meaning of the estimates obtained from equations (1) and (2)?
-
Are the coefficients of the variables , , and significant? Are they jointly significant?
(b) (13 marks) The student found that the variables , , and do not correlate with and with .
-
Can the effects of age be considered independently of the values of other variables?
-
What is the meaning of the coefficients of and in equation (2)? Evaluate the marginal effect of age for , , and and discuss the results. Is the influence of age on earnings significant?
-
What would be the consequences for evaluating equation (2) if the variable was excluded from it? Explain based on your knowledge of econometric theory.
-
Fearing the presence of heteroscedasticity, the student runs the Breusch-Pagan test for equation (2), obtaining the value of the statistic . For equation (1), she performs a White test with cross-terms included, obtaining . Help the student complete the tests for heteroscedasticity and draw conclusions. Explain your answer.
Question 14
Written part, Section A — original Question 2 — 25 marks
The student decided to investigate the factors that affect expenditures on air travel in the United States. To do this, she uses data from the 25 years, 1994-2018, prior to the outbreak of the Covid-19 pandemic, on total expenditure , total income , and the air travel relative price index , all taken in logarithms. She first builds the following regressions using OLS and Cochrane-Orcutt (C.O.) methods. Standard errors are in parentheses.
Standard errors:
Standard errors:
Standard errors:
Standard errors:
(a) (13 marks)
-
Why can one suggest the presence of autocorrelation in some of the equations listed above? Why is this question important when evaluating regression equations? Help the student explore this question using the Durbin-Watson test.
-
For what purpose does the student, along with equation (1), also calculate equations (2), (3), and (4)? Explain your opinion.
-
Has she been able to achieve her goals? Is there any reason to believe that there is no autocorrelation in equation (4), or should the student be advised to take an additional test? Which one?
A student's friend advised her to use a lagged variable as the best and simple tool to make the statistic acceptable. The corresponding equation is
Standard errors:
- Do you agree with the advice of the student's friend? Help her to test equation (5) for autocorrelation.
(b) (12 marks) The supervisor advised the student to consider the more general model ADL(1,1),
and conduct a Common Factor test for this model. The corresponding estimated models are as follows.
Unrestricted model
Standard errors:
Restricted model
Standard errors shown in the source:
- Demonstrate how to obtain the restricted specification of the ADL(1,1) model from the multiple regression model
with the autocorrelated disturbance term
-
Help the student to run the Common Factor test, stating the restrictions and making a conclusion.
-
Under what conditions is the Common Factor test valid? Explain how the student can test that these conditions are met.
Question 15
Written part, Section B — original Question 3 — 25 marks
SECTION B. Answer only ONE question from this section: Question 3 OR Question 4.
(a) (10 marks)
-
Explain what is meant by a stationary time series and a non-stationary time series. How to understand if a time series is stationary?
-
What is detrending of a time series? What is differencing of a time series? Explain what you understand by difference-stationary and trend-stationary time series. What is the difference in the impact of random shocks on difference-stationary and trend-stationary time series?
-
Demonstrate that the time trend
is a trend-stationary time series. There is no need to prove that this series is non-stationary. We assume
- Demonstrate that the random walk
is a difference-stationary time series. There is no need to prove that this series is non-stationary. Use the same assumptions about .
(b) (7 marks) Consider the following non-stationary process:
where is i.i.d. .
-
Explain the source or sources of non-stationarity of . Indicate at what values of the parameters process (1) turns out to be difference stationary and at what values it is trend stationary.
-
Investigate the implications of detrending process (1) under the assumption .
-
Investigate the consequences of applying the differencing transformation to process (1) under the assumption .
(c) (8 marks)
- Show that you can rewrite model (1) as
Clearly indicate the one-to-one relation between and .
- How can equation (2) be used to test time series (1) for stationarity?
Question 16
Written part, Section B — original Question 4 — 25 marks
The researcher investigates the effect of having vocational training available in high school on the probability of currently living in poverty for the population of men who grew up with a disadvantaged background. Let be a dummy variable equal to one if a man is currently living below the poverty line and zero otherwise. The variable is age, and is total years of schooling. Let be an indicator equal to unity if a man's high school offered vocational training. Using a random sample of 850 men, the researcher obtains
where
is the logit function.
(a) (10 marks)
-
Why is model (1) estimated by maximum likelihood and not OLS? Explain the meaning of the maximum likelihood method. What properties do estimates obtained by the maximum likelihood method have?
-
Discuss the benefits and drawbacks of using the logit regression model when trying to explain a binary variable .
-
Equation (1) contains information only about the estimated coefficients of the model. What additional information is needed to be able to judge the statistical quality of econometric model (1)? What tests can be carried out for this purpose?
(b) (7 marks)
- Use the direct comparison of two probabilities of living in poverty calculated by the logit function to evaluate the effect of having vocational training available in high school for a 40-year-old man with 12 years of education. Give details and interpret the results.
(c) (8 marks)
-
Now do the same estimation of the marginal effect of vocational education as in (b) using derivatives.
-
What percentage is the calculated marginal effect of the maximum possible?