1.Barron, A.K. (1985). The strong ergodic theorem for densities: Generalized Shannon–McMillan–Breiman theorem. Annals of Probability 13: 1292–1303.
2.Bowerman, B., David, H.T. & Isaacson, D. (1977). The convergence of Cesaro averages for certain nonstationary Markov chains. Stochastic Processes and their Applications 5: 221–230.
3.Breiman, L. (1957). The individual ergodic theorem of information theory. Annals of Mathematical Statistics 28: 629–635.
4.Chung, K.L. (1961). The ergodic theorem of information theory. Annals of Mathematical Statistics 32: 612–614.
5.Kieffer, J.C. (1974). A simple proof of the Moy–Perez generalization of the Shannon–McMillan theorem. Pacific Journal of Mathematics 51: 203–204.
6.McMillan, B. (1953). The basic theorems of information theory. Annals of Mathematical Statistics 24: 196–219.
7.Shannon, C. (1948). A mathematical theory of communication. Bell System Technical Journal 27: 379–423.
8.Yang, W.G. (1998). The asymptotic equipartition property for a nonhomogeneous markov Information source. Probability in the Engineering and Informational Sciences 21: 61–66.
9.Yang, W.G. (2002). Convergence in the Cesàro sense and strong law of large numbers for nonhomogeneous Markov chains. Linear Algebra and its Applications 354: 275–286.
10.Yang, W.G. (2009). Strong law of large numbers for countable nonhomogeneous Markov chains. Linear Algebra Applications 430: 3008–3018.
11.Zach, D. & Sunder, S. (2005). Large deviations for a class of nonhomogeneous Markov chains. Annals of Probability 15, 421–486.