Hostname: page-component-76fb5796d-wq484 Total loading time: 0 Render date: 2024-04-26T17:38:56.860Z Has data issue: false hasContentIssue false

Asymptotic Properties of Multicolor Randomly Reinforced Pólya Urns

Published online by Cambridge University Press:  22 February 2016

Li-Xin Zhang*
Affiliation:
Zhejiang University
Feifang Hu*
Affiliation:
University of Virginia
Siu Hung Cheung*
Affiliation:
The Chinese University of Hong Kong
Wai Sum Chan*
Affiliation:
The Chinese University of Hong Kong
*
Postal address: Department of Mathematics, Zhejiang University, Yuquan Campus, Hangzhou 310027, PR China. Email address: stazlx@zju.edu.cn
∗∗ Postal address: Department of Statistics, George Washington University, Washington, DC 20052, USA. Email address: feifang@gwu.edu
∗∗∗ Postal address: Department of Statistics, The Chinese University of Hong Kong, Shatin, Hong Kong, PR China. Email address: shcheung@cuhk.edu.hk
∗∗∗∗ Postal address: Department of Finance, The Chinese University of Hong Kong, Shatin, Hong Kong, PR China. Email address: chanws@cuhk.edu.hk
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

The generalized Pólya urn has been extensively studied and is widely applied in many disciplines. An important application of urn models is in the development of randomized treatment allocation schemes in clinical studies. The randomly reinforced urn was recently proposed, but, although the model has some intuitively desirable properties, it lacks theoretical justification. In this paper we obtain important asymptotic properties for multicolor reinforced urn models. We derive results for the rate of convergence of the number of patients assigned to each treatment under a set of minimum required conditions and provide the distributions of the limits. Furthermore, we study the asymptotic behavior for the nonhomogeneous case.

Type
General Applied Probability
Copyright
© Applied Probability Trust 

References

Aletti, G., Ghiglietti, A. and Paganoni, A. M. (2013). Randomly reinforced urn designs with prespecified allocations. J. Appl. Prob. 50, 486498.Google Scholar
Aletti, G., May, C. and Secchi, P. (2007). On the distribution of the limit proportion for a two-color, randomly reinforced urn with equal reinforcement distributions. Adv. Appl. Prob. 39, 690707.Google Scholar
Aletti, G., May, C. and Secchi, P. (2009). A central limit theorem, and related results, for a two-color randomly reinforced urn. Adv. Appl. Prob. 41, 829844.CrossRefGoogle Scholar
Aletti, G., May, C. and Secchi, P. (2012). A functional equation whose unknown is P([0,1]) valued. J. Theoret. Prob. 25, 12071232.Google Scholar
Athreya, K. B. and Ney, P. E. (1972). Branching Processes. Springer, New York.Google Scholar
Bai, Z. D. and Hu, F. (1999). Asymptotic theorems for urn models with nonhomogeneous generating matrices. Stoch. Process. Appl. 80, 87101.Google Scholar
Bai, Z.-D. and Hu, F. (2005). Asymptotics in randomized urn models. Ann. Appl. Prob. 15, 914940.Google Scholar
Bai, Z. D., Hu, F. and Rosenberger, W. F. (2002). Asymptotic properties of adaptive designs for clinical trials with delayed response. Ann. Statist. 30, 122139.Google Scholar
Beggs, A. W. (2005). On the convergence of reinforcement learning. J. Econom. Theory 122, 136.CrossRefGoogle Scholar
Berti, P., Crimaldi, I., Pratelli, L. and Rigo, P. (2010). Central limit theorems for multicolor urns with dominated colors. Stoch. Process. Appl. 120, 14731491.Google Scholar
Berti, P., Crimaldi, I., Pratelli, L. and Rigo, P. (2011). A central limit theorem and its applications to multicolor randomly reinforced urns. J. Appl. Prob. 48, 527546.Google Scholar
Chauvin, B., Pouyanne, N. and Sahnoun, R. (2011). Limit distributions for large Pólya urns. Ann. Appl. Prob. 21, 132.Google Scholar
Durham, S. D. and Yu, K. F. (1990). Randomized play-the leader rules for sequential sampling from two populations. Prob. Eng. Inf. Sci. 4, 355367.Google Scholar
Durham, S. D., Flournoy, N. and Li, W. (1998). A sequential design for maximizing the probability of a favourable response. Canad. J. Statist. 26, 479495.Google Scholar
Eggenberger, F. and Pälya, G. (1923). äber die Statistik verketteter Vorgänge. Z. Angew. Math. Mech. 3, 279289.CrossRefGoogle Scholar
Erev, I. and Roth, A. E. (1998). Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. Amer. Econom. Rev. 88, 848881.Google Scholar
Ghiglietti, A. and Paganoni, A. M. (2013). Randomly reinforced urn designs whose allocation proportions converge to arbitrary prespecifed values. In mODa 10–Advances in Model-Oriented Design and Analysis, Springer, Cham, pp. 99106.Google Scholar
Hall, P. and Heyde, C. C. (1980). Martingale Limit Theory and Its Application. Academic Press, New York.Google Scholar
Hu, F. and Rosenberger, W. F. (2006). The Theory of Response-Adaptive Randomization in Clinical Trials. John Wiley, Hoboken, NJ.Google Scholar
Hu, F. and Zhang, L.-X. (2004). Asymptotic properties of doubly adaptive biased coin designs for multi-treatment clinical trials. Ann. Statist. 32, 268301.CrossRefGoogle Scholar
Janson, S. (2004). Functional limit theorems for multitype branching processes and generalized Pólya urns. Stoch. Process. Appl. 110, 177245.Google Scholar
Janson, S. (2006). Limit theorems for triangular urn schemes. Prob. Theory Relat. Fields 134, 417452.Google Scholar
Li, W., Durham, S. D. and Flournoy, N. (1996). Randomized Pälya urn designs. In Proceedings of the Biometric Section, American Statistical Association, Alexandria, VA, pp. 166170.Google Scholar
Martin, C. F. and Ho, Y. C. (2002). Value of information in the Polya urn process. Inf. Sci. 147, 6590.Google Scholar
May, C. and Flournoy, N. (2009). Asymptotics in response-adaptive designs generated by a two-color, randomly reinforced urn. Ann. Statist. 37, 10581078.Google Scholar
Melfi, V. F. and Page, C. (2000). Estimation after adaptive allocation. J. Statist. Planning Inference 87, 353363.Google Scholar
Muliere, P., Paganoni, A. M. and Secchi, P. (2006). A randomly reinforced urn. J. Statist. Planning Inference 136, 18531874.Google Scholar
Muliere, P., Paganoni, A. M. and Secchi, P. (2006). Randomly reinforced urns for clinical trials with continuous responses. In SIS Proc. XLIII Scientific Meeting, Cleup, Padova, pp. 403414.Google Scholar
Paganoni, A. and Secchi, P. (2007). A numerical study for comparing two response-adaptive designs for continuous treatment effects. Statist. Meth. Appl. 16, 321346.Google Scholar
Pólya, G. (1930). Sur quelques points de la théorie des probabilités. Ann. Inst. H. Poincaré 1, 117161.Google Scholar
Rosenberger, W. F. (2002). Randomized urn models and sequential design. Sequential Anal. 21, 141.Google Scholar
Rosenberger, W. F. and Lachin, J. M. (2002). Randomization in Clinical Trials. Theory and Practice. John Wiley, New York.Google Scholar
Wei, L. J. and Durham, S. (1978). The randomized play-the-winner rule in medical trials. J. Amer. Statist. Assoc. 73, 840843.Google Scholar
Zhang, L. X., Hu, F. and Cheung, S. H. (2006). Asymptotic theorems of sequential estimation-adjusted urn models. Ann. Appl. Prob. 16, 340369.Google Scholar
Zhang, L. X., Hu, F., Cheung., S. H. and Chan, W. S. (2011). Immigrated urn models—theoretical properties and applications. Ann. Statist. 39, 643671.Google Scholar