Convergence Properties in Certain Occupancy Problems Including the Karlin-Rouault Law

Estáte V. Khmaladze

doi:10.1239/jap/1324046021

Convergence Properties in Certain Occupancy Problems Including the Karlin-Rouault Law

Part of: Limit theorems Distribution theory - Probability Sampling theory, sample surveys Distribution theory

Published online by Cambridge University Press: 14 July 2016

Estáte V. Khmaladze

Show author details

Estáte V. Khmaladze*: Affiliation:
Victoria University of Wellington
*: ∗ Postal address: School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, PO Box 600, Wellington, 2052, New Zealand. Email address: estate.khmaladze@msor.vuw.ac.nz

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Let x denote a vector of length q consisting of 0s and 1s. It can be interpreted as an ‘opinion’ comprised of a particular set of responses to a questionnaire consisting of q questions, each having {0, 1}-valued answers. Suppose that the questionnaire is answered by n individuals, thus providing n ‘opinions’. Probabilities of the answer 1 to each question can be, basically, arbitrary and different for different questions. Out of the 2 q different opinions, what number, μ n , would one expect to see in the sample? How many of these opinions, μ n (k), will occur exactly k times? In this paper we give an asymptotic expression for μ n / 2 q and the limit for the ratios μ n (k)/μ n , when the number of questions q increases along with the sample size n so that n = λ2 q , where λ is a constant. Let p( x ) denote the probability of opinion x . The key step in proving the asymptotic results as indicated is the asymptotic analysis of the joint behaviour of the intensities np( x ). For example, one of our results states that, under certain natural conditions, for any z > 0, ∑1 {np( x ) > z} = d n z −u , d n = o(2 q ).

Keywords

Number of unique outcomes sparse tables Karlin-Rouault law Zipf's law Good-Turing index large deviations contiguity

MSC classification

Secondary: 62D05: Sampling theory, sample surveys 62E20: Asymptotic distribution theory 60E05: Distributions 60F10: Large deviations

Type: Research Papers
Information: Journal of Applied Probability , Volume 48 , Issue 4 , December 2011 , pp. 1095 - 1113

DOI: https://doi.org/10.1239/jap/1324046021 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 2011

References

[1] Baayen, R. H. (2002). Word Frequency Distribution. Kluwer, Dordrecht.Google Scholar

[2] Bahadur, R. R. and Ranga Rao, R. (1960). On deviations of the sample mean. Ann. Math. Statist. 31, 1015–1027.Google Scholar

[3] Barbour, A. D. and Gnedin, A. V. (2009). Small counts in the infinite occupancy scheme. Electron. J. Prob. 14, 365–384.Google Scholar

[4] Chaganty, N. R. and Sethuraman, J. (1993). Strong large deviation and local limit theorems. Ann. Prob. 21, 1671–1690.Google Scholar

[5] Feller, W. (1986). Introduction to Probability Theory, Vol. 2. John Wiley, New York.Google Scholar

[6] Good, I. J. (1953). The population frequencies of species and the estimation of population parameters. Biometrica 40, 237–264.Google Scholar

[7] Greenwood, P. E. and Shiryaev, A. N. (1985). Contiguity and the Statistical Invariance Principle. Gordon and Breach, New York.Google Scholar

[8] Hwang, H.-K. and Janson, S. (2008). Local limit theorems for finite and infinite urn models. Ann. Prob. 36, 992–1022.Google Scholar

[9] Ivanov, V. A., Ivchenko, G. I. and Medvedev, Y. I. (1985). Discrete problems of probability theory (a survey). J. Soviet Math. 31, 2759–2795.Google Scholar

[10] Kallenberg, O. (1997). Foundations of Modern Probability. Springer, New York.Google Scholar

[11] Khmaladze, È. V. (1983). Martingale limit theorems for divisible statistics. Theory Prob. Appl. 28, 530–549.Google Scholar

[12] Khmaladze, È. V. (1988). The statistical analysis of a large number of rare events. Tech. Rep. MS-R8804, CWI, Amsterdam.Google Scholar

[13] Khmaladze, È. V. (2002). Zipf's law. In Encyclopaedia of Mathematics, Supplement III, Kluwer, Dordrecht.Google Scholar

[14] Khmaladze, È. V. and Tsigroshvili, Z. P. (1993). On polynomial distributions with a large number of rare events. Math. Meth. Statist. 2, 240–247.Google Scholar

[15] Klaassen, C. A. J. and Mnatsakanov, R. M. (2000). Consistent estimation of the structural distribution function. Scand. J. Statist. 27, 733–746.Google Scholar

[16] Kolassa, J. E. (1994). Series Approximation Methods in Statistics (Lecture Notes Statist. 88), Springer, New York.Google Scholar

[17] Kolchin, V. F., Sevastyanov, B. A. and Chistyakov, V. P. (1978). Random Allocations. Halsted Press, New York.Google Scholar

[18] Laplace, P.-S. (1995). Philosophical Essays on Probability (translation of 5th (1825) French edn.). Springer, New York.Google Scholar

[19] McAllester, D. and Schapire, R. E. (2000). On the convergence rate of Good–Turing estimators. In Proc. COLT 2000, pp. 1–6.Google Scholar

[20] Mirakhmedov, S. M. (2007). Asymptotic normality associated with generalized occupancy problem. Statist. Prob. Lett. 77, 1549–1558.Google Scholar

[21] Mnatsakanov, R. M. (1986). Functional limit theorem for additively separable statistics in the case of very rare events. Theory Prob. Appl. 30, 622–631.Google Scholar

[22] Oosterhoff, J. and van Zwet, W. R. (1979). A note on contiguity and Hellinger distance. In Contributions to Statistics, ed. Jurechkova, J., Reidel, Dordrecht, pp. 157–166.Google Scholar

[23] Orlitsky, A., Santhanam, N. P. and Zhang, J. (2003). Always good Turing: asymptotically optimal probability estimation. Science 302, 427–431.Google Scholar

[24] Rouault, A. (1978). Loi de Zipf et sources markoviennes. Ann. Inst. H. Poincaré 14, 169–188.Google Scholar

Article contents

Convergence Properties in Certain Occupancy Problems Including the Karlin-Rouault Law

Abstract

Keywords

MSC classification

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests