AN AVERAGING ESTIMATOR FOR TWO-STEP M-ESTIMATION IN SEMIPARAMETRIC MODELS

Ruoyao Shi

doi:10.1017/S0266466622000548

AN AVERAGING ESTIMATOR FOR TWO-STEP M-ESTIMATION IN SEMIPARAMETRIC MODELS

Published online by Cambridge University Press: 07 November 2022

Ruoyao Shi

Show author details

Ruoyao Shi*: Affiliation:
University of California, Riverside
*: Address correspondence to Ruoyao Shi, Department of Economics, University of California, Riverside, Riverside, CA, USA; e-mail: ruoyao.shi@ucr.edu.

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

In a two-step extremum estimation (M-estimation) framework with a finite-dimensional parameter of interest and a potentially infinite-dimensional first-step nuisance parameter, this paper proposes an averaging estimator that combines a semiparametric estimator based on a nonparametric first step and a parametric estimator which imposes parametric restrictions on the first step. The averaging weight is an easy-to-compute sample analog of an infeasible optimal weight that minimizes the asymptotic quadratic risk. Under Stein-type conditions, the asymptotic lower bound of the truncated quadratic risk difference between the averaging estimator and the semiparametric estimator is strictly less than zero for a class of data generating processes that includes both correct specification and varied degrees of misspecification of the parametric restrictions, and the asymptotic upper bound is weakly less than zero. The averaging estimator, along with an easy-to-implement inference method, is demonstrated in an example.

Type: ARTICLES
Information: Econometric Theory , Volume 40 , Issue 3 , June 2024 , pp. 652 - 687

DOI: https://doi.org/10.1017/S0266466622000548 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The comments from the Editor (Peter C.B. Phillips), the Associate Editor (Patrik Guggenberger), and two anonymous referees were vastly helpful in improving this paper. The author also thanks Colin Cameron, Xu Cheng, Denis Chetverikov, Yanqin Fan, Jinyong Hahn, Bo Honoré, Toru Kitagawa, Zhipeng Liao, Hyungsik Roger Moon, Whitney Newey, Geert Ridder, Aman Ullah, Haiqing Xu, and the participants at various seminars and conferences for helpful comments. This project was generously supported by UC Riverside Regents’ Faculty Fellowship 2019–2020. Zhuozhen Zhao provided great research assistance. All remaining errors are the author’s.

References

REFERENCES

Ackerberg, D., Chen, X., & Hahn, J. (2012) A practical asymptotic variance estimator for two-step semiparametric estimators. Review of Economics and Statistics 94(2), 481P–498.CrossRef Google Scholar

Ackerberg, D., Chen, X., Hahn, J., & Liao, Z. (2014) Asymptotic efficiency of semiparametric two-step GMM. Review of Economic Studies 81(3), 919–943.CrossRef Google Scholar

Ahn, H., Ichimura, H., & Powell, J.L. (1996) Simple estimators for monotone index models. Manuscript, Department of Economics, UC Berkeley.Google Scholar

Ahn, H. & Powell, J.L. (1993) Semiparametric estimation of censored selection models with a nonparametric selection mechanism. Journal of Econometrics 58(1–2), 3–29.CrossRef Google Scholar

Altonji, J.G., Elder, T.E., & Taber, C.R. (2005) Selection on observed and unobserved variables: Assessing the effectiveness of catholic schools. Journal of Political Economy 113(1), 151–184.CrossRef Google Scholar

Andrews, D.W. (1994) Asymptotics for semiparametric econometric models via stochastic equicontinuity. Econometrica 62(1), 43–72.CrossRef Google Scholar

Andrews, D.W., Cheng, X., & Guggenberger, P. (2020) Generic results for establishing the asymptotic size of confidence sets and tests. Journal of Econometrics 218(2), 496–531.CrossRef Google Scholar

Andrews, D.W. & Guggenberger, P. (2009) Validity of subsampling and “plug-in asymptotic” inference for parameters defined by moment inequalities. Econometric Theory 25(3), 669–709.CrossRef Google Scholar

Andrews, D.W. & Guggenberger, P. (2010) Asymptotic size and a problem with subsampling and with the m out of n bootstrap. Econometric Theory 26(2), 426–468.CrossRef Google Scholar

Andrews, I., Gentzkow, M., & Shapiro, J.M. (2017) Measuring the sensitivity of parameter estimates to estimation moments. The Quarterly Journal of Economics 132(4), 1553–1592.CrossRef Google Scholar

Armstrong, T.B. & Kolesár, M. (2021) Sensitivity analysis using approximate moment condition models. Quantitative Economics 12(1), 77–108.CrossRef Google Scholar

Bang, H. & Robins, J.M. (2005) Doubly robust estimation in missing data and causal inference models. Biometrics 61(4), 962–973.CrossRef Google Scholar PubMed

Bickel, P.J., Klaassen, C.A., Ritov, J., & Wellner, J.A. (1993) Efficient and Adaptive Estimation for Semiparametric Models . Johns Hopkins University Press.Google Scholar

Bickel, P.J. & Ritov, Y. (2003) Nonparametric estimators which can be “plugged-in”. Annals of Statistics 31(4), 1033–1053.CrossRef Google Scholar

Bierens, H.J. (1990) A consistent conditional moment test of functional form. Econometrica 58(6), 1443–1458.CrossRef Google Scholar

Blundell, R. & Powell, J.L. (2003) Endogeneity in nonparametric and semiparametric regression models. In Advances in Economics and Econometrics . Cambridge University Press.Google Scholar

Blundell, R.W. & Powell, J.L. (2004) Endogeneity in semiparametric binary response models. The Review of Economic Studies 71(3), 655–679.CrossRef Google Scholar

Bonhomme, S. & Weidner, M. (2021) Minimizing sensitivity to model misspecification. Preprint, arXiv:1807.02161.Google Scholar

Buchholz, N., Shum, M., & Xu, H. (2021) Semiparametric estimation of dynamic discrete choice models. Journal of Econometrics 223(2), 312–327.CrossRef Google Scholar

Cao, W., Tsiatis, A.A., & Davidian, M. (2009) Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika 96(3), 723–734.CrossRef Google Scholar PubMed

Chen, X., Linton, O., & Van Keilegom, I. (2003) Estimation of semiparametric models when the criterion function is not smooth. Econometrica 71(5), 1591–1608.CrossRef Google Scholar

Cheng, X., Liao, Z., & Shi, R. (2019) On uniform asymptotic risk of averaging GMM estimators. Quantitative Economics 10(3), 931–979.CrossRef Google Scholar

Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., & Robins, J. (2018) Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21(1), C1–C68.CrossRef Google Scholar

Chernozhukov, V., Escanciano, J.C., Ichimura, H., Newey, W.K., & Robins, J.M. (2022) Locally robust semiparametric estimation. Econometrica 90(4), 1501–1535.CrossRef Google Scholar

Claeskens, G. & Hjort, N.L. (2008) Model Selection and Model Averaging . Cambridge University Press.Google Scholar

Crepon, B., Kramarz, F., & Trognon, A. (1997) Parameters of interest, nuisance parameters and orthogonality conditions. An application to autoregressive error component models. Journal of Econometrics 82(1), 135–156.CrossRef Google Scholar

DiTraglia, F.J. (2016) Using invalid instruments on purpose: Focused moment selection and averaging for GMM. Journal of Econometrics 195(2), 187–208.CrossRef Google Scholar

Donald, S.G. & Newey, W.K. (1994) Series estimation of semilinear models. Journal of Multivariate Analysis 50(1), 30–40.CrossRef Google Scholar

Fan, Y. & Ullah, A. (1999) Asymptotic normality of a combined regression estimator. Journal of Multivariate Analysis 71(2), 191–240.CrossRef Google Scholar

Fessler, P. & Kasy, M. (2019) How to use economic theory to improve estimators: Shrinking toward theoretical restrictions. Review of Economics and Statistics 101(4), 681–698.CrossRef Google Scholar

Firpo, S. (2007) Efficient semiparametric estimation of quantile treatment effects. Econometrica 75(1), 259–276.CrossRef Google Scholar

Fourdrinier, D., Strawderman, W.E., & Wells, M.T. (2018) Shrinkage Estimation . Springer.CrossRef Google Scholar

Gallant, A.R. & Nychka, D.W. (1987) Semi-nonparametric maximum likelihood estimation. Econometrica 55(2), 363–390.CrossRef Google Scholar

Hahn, J. & Liao, Z. (2021) Bootstrap standard error estimates and inference. Econometrica 89(4), 1963–1977.CrossRef Google Scholar

Han, A.K. (1987) Non-parametric analysis of a generalized regression model: The maximum rank correlation estimator. Journal of Econometrics 35(2–3), 303–316.CrossRef Google Scholar

Hansen, B.E. (2007) Least squares model averaging. Econometrica 75(4), 1175–1189.CrossRef Google Scholar

Hansen, B.E. (2014) Model averaging, asymptotic risk, and regressor groups. Quantitative Economics 5(3), 495–530.CrossRef Google Scholar

Hansen, B.E. (2016) Efficient shrinkage in parametric models. Journal of Econometrics 190(1), 115–132.CrossRef Google Scholar

Hansen, B.E. (2017) Stein-like 2SLS estimator. Econometric Reviews 36(6–9), 840–852.CrossRef Google Scholar

Hansen, B.E. & Racine, J.S. (2012) Jackknife model averaging. Journal of Econometrics 167(1), 38–46.CrossRef Google Scholar

Heckman, J.J. (1976) The common structure of statistical models of truncation, sample selection and limited dependent variables and a simple estimator for such models. In Annals of Economic and Social Measurement , vol. 5, pp. 475–492. National Bureau of Economic Research.Google Scholar

Heckman, J.J. (1979) Sample selection bias as a specification error. Econometrica 47(1), 153–161.CrossRef Google Scholar

Hirano, K., Imbens, G.W., & Ridder, G. (2003) Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71(4), 1161–1189.CrossRef Google Scholar

Hjort, N.L. & Claeskens, G. (2003) Frequentist model average estimators. Journal of the American Statistical Association 98(464), 879–899.CrossRef Google Scholar

Hjort, N.L. & Claeskens, G. (2006) Focused information criteria and model averaging for the Cox hazard regression model. Journal of the American Statistical Association 101(476), 1449–1464.CrossRef Google Scholar

Honoré, B.E. (1992) Trimmed LAD and least squares estimation of truncated and censored regression models with fixed effects. Econometrica 60(3), 533–565.CrossRef Google Scholar

Hotz, V.J. & Miller, R.A. (1993) Conditional choice probabilities and the estimation of dynamic models. The Review of Economic Studies 60(3), 497–529.CrossRef Google Scholar

Ichimura, H. & Lee, S. (2010) Characterization of the asymptotic distribution of semiparametric M-estimators. Journal of Econometrics 159(2), 252–266.CrossRef Google Scholar

Ichimura, H. & Newey, W. (2017) The Influence Function of Semiparametric Estimators. CEMMAP Working paper CWP06/17, The Institute for Fiscal Studies, Department of Economics, University College London.CrossRef Google Scholar

Imbens, G.W. (2003) Sensitivity to exogeneity assumptions in program evaluation. American Economic Review 93(2), 126–132.CrossRef Google Scholar

James, W. & Stein, C. (1961) Estimation with quadratic loss. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability , vol. 1, pp. 361–379. University of California Press.Google Scholar

Judge, G.G. & Mittelhammer, R.C. (2004) A semiparametric basis for combining estimation problems under quadratic loss. Journal of the American Statistical Association 99(466), 479–487.CrossRef Google Scholar

Judge, G.G. & Mittelhammer, R.C. (2007) Estimation and inference in the case of competing sets of estimating equations. Journal of Econometrics 138(2), 513–531.CrossRef Google Scholar

Keane, M.P. & Wolpin, K.I. (1997) The career decisions of young men. Journal of Political Economy 105(3), 473–522.CrossRef Google Scholar

Kitagawa, T. & Muris, C. (2016) Model averaging in semiparametric estimation of treatment effects. Journal of Econometrics 193(1), 271–289.CrossRef Google Scholar

Klein, R.W. & Spady, R.H. (1993) An efficient semiparametric estimator for binary response models. Econometrica 61(2), 387–421.CrossRef Google Scholar

Le Cam, L. (1972) Limits of experiments. In Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability , vol. 1, pp. 245–261. University of California Press.Google Scholar

Leamer, E.E. (1985) Sensitivity analyses would help. The American Economic Review 75(3), 308–313.Google Scholar

Lee, L.-F. (1982) Some approaches to the correction of selectivity bias. The Review of Economic Studies 49(3), 355–372.CrossRef Google Scholar

Leeb, H. & Pötscher, B.M. (2005) Model selection and inference: Facts and fiction. Econometric Theory 21(1), 21–59.CrossRef Google Scholar

Leeb, H. & Pötscher, B.M. (2008) Sparse estimators and the oracle property, or the return of Hodges’ estimator. Journal of Econometrics 142(1), 201–211.CrossRef Google Scholar

Liu, C.-A. (2015) Distribution theory of the least squares averaging estimator. Journal of Econometrics 186(1), 142–159.CrossRef Google Scholar

Lu, X. & Su, L. (2015) Jackknife model averaging for quantile regressions. Journal of Econometrics 188(1), 40–58.CrossRef Google Scholar

Magnus, J.R., Powell, O., & Prüfer, P. (2010) A comparison of two model averaging techniques with an application to growth empirics. Journal of Econometrics 154(2), 139–153.CrossRef Google Scholar

Mittelhammer, R.C. & Judge, G.G. (2005) Combining estimators to improve structural model estimation and inference under quadratic loss. Journal of Econometrics 128(1), 1–29.CrossRef Google Scholar

Mukhin, Y. (2018) Sensitivity of regular estimators. Preprint, arXiv:1805.08883.Google Scholar

Nelson, F.D. (1984) Efficiency of the two-step estimator for models with endogenous sample selection. Journal of Econometrics 24, 181–196.CrossRef Google Scholar

Newey, W.K. (1990) Semiparametric efficiency bounds. Journal of Applied Econometrics 5(2), 99–135.CrossRef Google Scholar

Newey, W.K. (1994) The asymptotic variance of semiparametric estimators. Econometrica 62(6), 1349–1382.CrossRef Google Scholar

Newey, W.K. (2009) Two-step series estimation of sample selection models. The Econometrics Journal 12, S217–S229.CrossRef Google Scholar

Newey, W.K. & McFadden, D. (1994) Large sample estimation and hypothesis testing. In Handbook of Econometrics , vol. 4, pp. 2111–2245. Elsevier.Google Scholar

Newey, W.K. & Powell, J.L. (1993) Efficiency bounds for some semiparametric selection models. Journal of Econometrics 58(1–2), 169–184.CrossRef Google Scholar

Newey, W.K. & Powell, J.L. (1999) Two-Step Estimation, Optimal Moment Conditions, and Sample Selection Models. Working paper 99-06, Department of Economics, Massachusetts Institute of Technology.Google Scholar

Newey, W.K., Powell, J.L., & Walker, J.R. (1990) Semiparametric estimation of selection models: Some empirical results. The American Economic Review 80(2), 324–328.Google Scholar

Neyman, J. (1959) Optimal asymptotic tests of composite hypotheses. In Probability and Statsitics , pp. 213–234. Wiley.Google Scholar

Oster, E. (2019) Unobservable selection and coefficient stability: Theory and evidence. Journal of Business & Economic Statistics 37(2), 187–204.CrossRef Google Scholar

Pakes, A. & Olley, S. (1995) A limit theorem for a smooth class of semiparametric estimators. Journal of Econometrics 65(1), 295–332.CrossRef Google Scholar

Peng, J. & Yang, Y. (2022) On improvability of model selection by model averaging. Journal of Econometrics 229(2), 246–262.CrossRef Google Scholar

Powell, J.L. (1986) Symmetrically trimmed least squares estimation for Tobit models. Econometrica 54(6), 1435–1460.CrossRef Google Scholar

Powell, J.L. (1994) Estimation of semiparametric models. Handbook of Econometrics 4, 2443–2521.CrossRef Google Scholar

Powell, J.L. (2001) Semiparametric estimation of censored selection models. In Hsiao, C., Morimune, K., & Powell, J. (eds.), Nonlinear Statistical Modeling: Proceedings of the Thirteenth International Symposium in Economic Theory and Econometrics: Essays in Honor of Takeshi Amemiya , vol. 13, pp. 165–196. Cambridge University Press.CrossRef Google Scholar

Robinson, P.M. (1988) Root-N-consistent semiparametric regression. Econometrica 56(4), 931–954.CrossRef Google Scholar

Robinson, P.M. (1989) Hypothesis testing in semiparametric and nonparametric models for econometric time series. The Review of Economic Studies 56(4), 511–534.CrossRef Google Scholar

Rosenbaum, P.R. & Rubin, D.B. (1983) Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society: Series B (Methodological) 45(2), 212–218.CrossRef Google Scholar

Rubin, D.B. & van der Laan, M.J. (2008) Empirical efficiency maximization: Improved locally efficient covariate adjustment in randomized experiments and survival analysis. The International Journal of Biostatistics 4(1), Article no. 5.CrossRef Google Scholar PubMed

Scharfstein, D.O., Rotnitzky, A., & Robins, J.M. (1999) Adjusting for nonignorable drop-out using semiparametric nonresponse models. Journal of the American Statistical Association 94(448), 1096–1120.CrossRef Google Scholar

Shao, J. (1992) Bootstrap variance estimators with truncation. Statistics & Probability Letters 15(2), 95–101.CrossRef Google Scholar

Sherman, R.P. (1993) The limiting distribution of the maximum rank correlation estimator. Econometrica 61(1), 123–137.CrossRef Google Scholar

Stein, C. (1956) Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics , pp. 197–206. University of California Press.Google Scholar

Tsiatis, A.A., Davidian, M., & Cao, W. (2011) Improved doubly robust estimation when data are monotonely coarsened, with application to longitudinal studies with dropout. Biometrics 67(2), 536–545.CrossRef Google Scholar PubMed

Van der Vaart, A.W. (2000) Asymptotic Statistics . Cambridge University Press.Google Scholar

Wales, T.J. & Woodland, A.D. (1980) Sample selectivity and the estimation of labor supply functions. International Economic Review 21(2), 437–468.CrossRef Google Scholar

Wan, A.T., Zhang, X., & Zou, G. (2010) Least squares model averaging by mallows criterion. Journal of Econometrics 156(2), 277–283.CrossRef Google Scholar

Wasserman, L. (2006) All of Nonparametric Statistics . Springer Science & Business Media.Google Scholar

Yang, Y. (2001) Adaptive regression by mixing. Journal of the American Statistical Association 96(454), 574–588.CrossRef Google Scholar

Yang, Y. (2003) Regression with multiple candidate models: Selecting or mixing? Statistica Sinica 13(3), 783–809.Google Scholar

Yang, Y. (2005) Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation. Biometrika 92(4), 937–950.CrossRef Google Scholar

Zhang, X. & Liang, H. (2011) Focused information criterion and model averaging for generalized additive partial linear models. Annals of Statistics 39(1), 174–200.CrossRef Google Scholar

Shi supplementary material

PDF 1.9 MB

Article contents

AN AVERAGING ESTIMATOR FOR TWO-STEP M-ESTIMATION IN SEMIPARAMETRIC MODELS

Abstract

Access options

Footnotes

References

REFERENCES

Shi supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests