Hostname: page-component-7c8c6479df-24hb2 Total loading time: 0 Render date: 2024-03-28T08:53:53.124Z Has data issue: false hasContentIssue false

Using auxiliary information in statistical function estimation

Published online by Cambridge University Press:  16 December 2005

Sergey Tarima
Affiliation:
Division of Biostatistics, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, Wisconsin, 53226, USA; starima@hpi.mcw.edu
Dmitri Pavlov
Affiliation:
Clinical Biostatistics, Pfizer Inc., 50 Pequot Avenue, New London, Connecticut, 06320, USA; dmitri.pavlov@pfizer.com
Get access

Abstract

In many practical situations sample sizes are not sufficiently large and estimators based on such samples may not be satisfactory in terms of their variances. At the same time it is not unusual that some auxiliary information about the parameters of interest is available. This paper considers a method of using auxiliary information for improving properties of the estimators based on a current sample only. In particular, it is assumed that the information is available as a number of estimates based on samples obtained from some other mutually independent data sources. This method uses the fact that there is a correlation effect between estimators based on the current sample and auxiliary information from other sources. If variance covariance matrices of vectors of estimators used in the estimating procedure are known, this method produces more efficient estimates in terms of their variances compared to the estimates based on the current sample only. If these variance-covariance matrices are not known, their consistent estimates can be used as well such that the large sample properties of the method remain unchangeable. This approach allows to improve statistical properties of many standard estimators such as an empirical cumulative distribution function, empirical characteristic function, and Nelson-Aalen cumulative hazard estimator.

Type
Research Article
Copyright
© EDP Sciences, SMAI, 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Chambers, R.L. and Dunstan, R., Estimating distribution functions from survey data. Biometrika 73 (1986) 597604. CrossRef
Y.G. Dmitriev and Y.C. Ustinov, Statistical estimation of probability distribution with auxiliary information [in Russian]. Tomsk State University, Tomsk (1988).
T.R. Fleming and D.P. Harrington, Counting processes and survival analysis. Wiley (1991).
Gal'chenko, M.V. and Gurevich, V.A., Minimum-contrast estimation taking into account additional information. J. Soviet Math. 53 (1991) 547551. CrossRef
D. Holt and D. Elliot, Methods of weighting for unit non-response. The Statistician, Special Issue: Survey Design, Methodology and Analysis 40 (1991) 333–342.
Haberman, S.J., Adjustment by minimum discriminant information. Ann. Statist. 12 (1984) 121140. CrossRef
Kuk, A.Y.C. and Mak, T.K., Median estimation in the presence of auxiliary information. J. R. Statist. Soc. B 51 (1989) 261269.
G. Kulldorff, Contribution to the theory of estimation from grouped and partially grouped samples. Almqvist & Wiksell, Stockholm (1961).
R.J.A. Little and D.B. Rubin, Statistical analysis with missing data. Wiley (2002).
A.B. Owen, Empirical likelihood. Chapman and Hall (2001).
V.N. Pugachev, Mixed methods of determining probabilistic characteristics [in Russian]. Soviet Radio, Moscow (1973).
Rao, J.N.K., Kovar, J.G. and Mantel, H.J., On estimating distribution functions and quantiles from survey data using auxiliary information. Biometrika 77 (1990) 365375. CrossRef
Zhang, B., Confidence intervals for a distribution function in the presence of auxiliary information. Comput. Statist. Data Anal. 21 (1996) 327342. CrossRef