Skip to main content Accessibility help


  • Tsz Chai Fung (a1), Andrei L. Badescu (a2) and X. Sheldon Lin (a3)


This paper focuses on the estimation and application aspects of the Erlang count logit-weighted reduced mixture of experts model (EC-LRMoE), which is a fully flexible multivariate insurance claim frequency regression model. We first prove the identifiability property of the proposed model to ensure that it is a suitable candidate for statistical inference. An expectation conditional maximization (ECM) algorithm is developed for efficient model calibrations. Three simulation studies are performed to examine the effectiveness of the proposed ECM algorithm and the versatility of the proposed model. The applicability of the EC-LRMoE is shown through fitting an European automobile insurance data set. Since the data set contains several complex features, we find it necessary to adopt such a flexible model. Apart from showing excellent fitting results, we are able to interpret the fitted model in an insurance perspective and to visualize the relationship between policyholders’ information and their risk level. Finally, we demonstrate how the fitted model may be useful for insurance ratemaking.



Hide All
Asmussen, S., Nerman, O. and Olsson, M. (1996). Fitting phase-type distributions via the EM algorithm. Scandinavian Journal of Statistics 23(4), 419441.
Badescu, A. L., Lin, X. S., Tang, D. and Valdez, E. A. (2015). Multivariate Pascal mixture regression models for correlated claim frequencies. Available in SSRN:
Bermúdez, L. (2009). A priori ratemaking using bivariate Poisson regression models. Insurance: Mathematics and Economics 44(1), 135141.
Bermúdez, L. and Karlis, D. (2011). Bayesian multivariate Poisson models for insurance ratemaking. Insurance: Mathematics and Economics 48(2), 226236.
Cappé, O., Moulines, E. and Rydén, T. (2005). Inference in Hidden Markov Models. Heidelberg: Springer.
Conway, R. W. and Maxwell, W. L. (1962). A queuing model with state dependent service rates. Journal of Industrial Engineering 12(2), 132136.
Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 138.
Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96(456), 13481360.
Frees, E. W., Lee, G. and Yang, L. (2016). Multivariate frequency/severity regression models in insurance. Risks 4(1), 4.
Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. The Annals of Statistics 29(5), 11891232.
Fung, T. C., Badescu, A. L. and Lin, X. S. (2019a). A class of mixture of experts models for general insurance: Theoretical developments. Submitted.
Fung, T. C., Badescu, A. L. and Lin, X. S. (2019b). Multivariate Cox hidden Markov models with an application to operational risk. Scandinavian Actuarial Journal, accepted.
Gui, W., Huang, R. and Lin, X. S. (2018). Fitting the Erlang mixture model to data via a GEM-CMM algorithm. Journal of Computational and Applied Mathematics 343, 189205.
Jacobs, R. A., Jordan, M. I., Nowlan, S. J. and Hinton, G. E. (1991). Adaptive mixtures of local experts. Neural Computation 3(1), 7987.
Jiang, W. and Tanner, M. A. (1999). On the identifiability of mixtures-of-experts. Neural Networks 12(9), 12531258.
Jordan, M. I. and Jacobs, R. A. (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation 6(2), 181214.
Kuha, J. (2004). AIC and BIC: Comparisons of assumptions and performance. Sociological Methods & Research 33(2), 188229.
Lee, S. C. K. and Lin, X. S. (2010). Modeling and evaluating insurance losses via mixtures of Erlang distributions. North American Actuarial Journal 14(1), 107130.
Lee, S. C. K. and Lin, X. S. (2012). Modeling dependent risks with multivariate Erlang mixtures. ASTIN Bulletin: The Journal of the IAA 42(1), 153180.
Lord, D., Guikema, S. D. and Geedipally, S. R. (2008). Application of the Conway–Maxwell–Poisson generalized linear model for analyzing motor vehicle crashes. Accident Analysis & Prevention 40(3), 11231134.
McLachlan, G. and Peel, D. (2000). Finite Mixture Models. Wiley Series in Probability and Statistics.
Meng, X.-L. and Rubin, D. B. (1993). Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 80(2), 267278.
Shi, P. and Valdez, E. A. (2014). Multivariate negative binomial models for insurance claim counts. Insurance: Mathematics and Economics 55, 1829.
Teicher, H. (1963). Identifiability of finite mixtures. The Annals of Mathematical Statistics, 34(4), 12651269.
Teicher, H. (1967). Identifiability of mixtures of product measures. The Annals of Mathematical Statistics 38(4), 13001302.
Wedel, M. and DeSarbo, W. S. (1995). A mixture likelihood approach for generalized linear models. Journal of Classification 12(1), 2155.
Winkelmann, R. (1995). Duration dependence and dispersion in count-data models. Journal of Business & Economic Statistics 13(4), 467474.
Yin, C. and Lin, X. S. (2016). Efficient estimation of Erlang mixtures using iSCAD penalty with insurance application. ASTIN Bulletin: The Journal of the IAA 46(3), 779799.



  • Tsz Chai Fung (a1), Andrei L. Badescu (a2) and X. Sheldon Lin (a3)


Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed.