Tree-based models for variable annuity valuation: parameter tuning and empirical analysis

Zhiyu Quan; Guojun Gan; Emiliano Valdez

doi:10.1017/S1748499521000075

Tree-based models for variable annuity valuation: parameter tuning and empirical analysis

Published online by Cambridge University Press: 16 March 2021

Zhiyu Quan ,

Guojun Gan and

Emiliano Valdez

Show author details

Zhiyu Quan: Affiliation:
Department of Mathematics, University of Illinois at Urbana-Champaign, Champaign, IL61801, USA
Guojun Gan*: Affiliation:
Department of Mathematics, University of Connecticut, Storrs, CT06269-1009, USA
Emiliano Valdez: Affiliation:
Department of Mathematics, University of Connecticut, Storrs, CT06269-1009, USA
*: *Corresponding author. E-mail: guojun.gan@uconn.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Variable annuities have become popular retirement and investment vehicles due to their attractive guarantee features. Nonetheless, managing the financial risks associated with the guarantees poses great challenges for insurers. One challenge is risk quantification, which involves frequent valuation of the guarantees. Insurers rely on the use of Monte Carlo simulation for valuation as the guarantees are too complicated to be valued by closed-form formulas. However, Monte Carlo simulation is computationally intensive. In this paper, we empirically explore the use of tree-based models for constructing metamodels for the valuation of the guarantees. In particular, we consider traditional regression trees, tree ensembles, and trees based on unbiased recursive partitioning. We compare the performance of tree-based models to that of existing models such as ordinary kriging and generalised beta of the second kind (GB2) regression. Our results show that tree-based models are efficient in producing accurate predictions and the gradient boosting method is considered the most superior in terms of prediction accuracy.

Keywords

Tree-based model Variable annuity Portfolio valuation Metamodelling

Type: Original Research Paper
Information: Annals of Actuarial Science , Volume 16 , Issue 1 , March 2022 , pp. 95 - 118

DOI: https://doi.org/10.1017/S1748499521000075 [Opens in a new window]
Copyright: © The Author(s), 2021. Published by Cambridge University Press on behalf of Institute and Faculty of Actuaries

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Allen, D.M. (1974). The relationship between variable selection and data agumentation and a method for prediction. Technometrics, 16(1), 125–127.CrossRef Google Scholar

Arlot, S. & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79.CrossRef Google Scholar

Bergstra, J. & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13, 281–305.Google Scholar

Bergstra, J.S., Bardenet, R., Bengio, Y. & Kégl, B. (2011). Algorithms for hyper-parameter optimization. In Advances in Neural Information Processing Systems (pp. 2546–2554).Google Scholar

Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140.CrossRef Google Scholar

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.CrossRef Google Scholar

Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J. (1984). Classification and Regression Trees. Taylor & Francis Group, LLC, Boca Raton, FL.Google Scholar

Dang, O., Feng, M. & Hardy, M.R. (2019). Efficient nested simulation for conditional tail expectation of variable annuities. North American Actuarial Journal, 24(2), 187–210.CrossRef Google Scholar

Devroye, L. & Wagner, T. (1979). Distribution-free performance bounds for potential function rules. IEEE Transactions on Information Theory, 25(5), 601–604.CrossRef Google Scholar

Feng, B.M., Tan, Z. & Zheng, J. (2020). Efficient simulation designs for valuation of large variable annuity portfolios. North American Actuarial Journal, 24(2), 275–289.CrossRef Google Scholar

Freund, Y. & Schapire, R.E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139.CrossRef Google Scholar

Friedman, J.H. (2001). Greedy function approximation: a gradient boosting machine. Annals of Statistics, 29(5), 1189–1232.CrossRef Google Scholar

Gan, G. (2013). Application of data clustering and machine learning in variable annuity valuation. Insurance: Mathematics and Economics, 53(3), 795–801.Google Scholar

Gan, G. (2018). Valuation of large variable annuity portfolios using linear models with interactions. Risks, 6(3), 71.CrossRef Google Scholar

Gan, G. & Lin, X.S. (2015). Valuation of large variable annuity portfolios under nested simulation: a functional data approach. Insurance: Mathematics and Economics, 62, 138–150.Google Scholar

Gan, G. & Lin, X.S. (2017). Efficient greek calculation of variable annuity portfolios for dynamic hedging: a two-level metamodeling approach. North American Actuarial Journal, 21(2), 161–177.CrossRef Google Scholar

Gan, G. & Valdez, E.A. (2017a). Modeling partial greeks of variable annuities with dependence. Insurance: Mathematics and Economics, 76, 118–134.Google Scholar

Gan, G. & Valdez, E.A. (2017b). Valuation of large variable annuity portfolios: Monte Carlo simulation and synthetic datasets. Dependence Modeling, 5, 354–374.CrossRef Google Scholar

Gan, G. & Valdez, E.A. (2018). Regression modeling for the valuation of large variable annuity portfolios. North American Actuarial Journal, 22(1), 40–54.CrossRef Google Scholar

Geisser, S. (1974). A predictive approach to the random effect model. Biometrika, 61(1), 101–107.CrossRef Google Scholar

Geisser, S. (1975). The predictive sample reuse method with applications. Journal of the American statistical Association, 70(350), 320–328.CrossRef Google Scholar

Guelman, L., Guillén, M. & Pérez-Marn, A.M. (2014). A survey of personalized treatment models for pricing strategies in insurance. Insurance: Mathematics and Economics, 58, 68–76.Google Scholar

Gweon, H., Li, S. & Mamon, R. (2020). An effective bias-corrected bagging method for the valuation of large variable annuity portfolios. ASTIN Bulletin, 50(3), 853–871.CrossRef Google Scholar

Hardy, M. (2003). Investment Guarantees: Modeling and Risk Management for Equity-Linked Life Insurance, vol. 215. John Wiley & Sons, Hoboken, NJ.Google Scholar

Hejazi, S.A. & Jackson, K.R. (2016). A neural network approach to efficient valuation of large portfolios of variable annuities. Insurance: Mathematics and Economics, 70, 169–181.Google Scholar

Hejazi, S.A., Jackson, K.R. & Gan, G. (2017). A spatial interpolation framework for efficient valuation of large portfolios of variable annuities. Quantitative Finance and Economics, 1(2), 125–144.Google Scholar

Hothorn, T., Hornik, K. & Zeileis, A. (2006). Unbiased recursive partitioning: a conditional inference framework. Journal of Computational and Graphical Statistics, 15(3), 651–674.CrossRef Google Scholar

Ishwaran, H. (2007). Variable importance in binary regression trees and forests. Electronic Journal of Statistics, 1, 519–537.CrossRef Google Scholar

Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. In International Joint Conferences on Artificial Intelligence (IJCAI) (pp. 1137–1145), vol. 14. Montreal, Canada.Google Scholar

Larsen, J., Hansen, L.K., Svarer, C. & Ohlsson, M. (1996). Design and regularization of neural networks: the optimal use of a validation set. In Neural Networks for Signal Processing VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop (pp. 62–71).CrossRef Google Scholar

Lee, S.C. & Lin, S. (2018). Delta boosting machine with application to general insurance. North American Actuarial Journal, 22(3), 405–425.CrossRef Google Scholar

Leung, D.H.-Y. (2005). Cross-validation in nonparametric regression with outliers. The Annals of Statistics, 33(5), 2291–2310.CrossRef Google Scholar

Lin, X.S. & Yang, S. (2020). Fast and efficient nested simulation for large variable annuity portfolios: a surrogate modeling approach. Insurance: Mathematics and Economics, 91, 85–103.Google Scholar

Liu, K. & Tan, K.S. (2020). Real-time valuation of large variable annuity portfolios: a green mesh approach. North American Actuarial Journal, In Press.Google Scholar

Martinez-Cantin, R. (2014). Bayesopt: a Bayesian optimization library for nonlinear optimization, experimental design and bandits. The Journal of Machine Learning Research, 15(1), 3735–3739.Google Scholar

Moors, E. (1920). On the reciprocal of the general algebraic matrix. Bulletin of the American Mathematical Society, 26, 394–395.Google Scholar

Morgan, J.N. & Sonquist, J.A. (1963). Problems in the analysis of survey data, and a proposal. Journal of the American Statistical Association, 58(302), 415–434.CrossRef Google Scholar

Mosteller, F. & Tukey, J.W. (1968). Data analysis, including statistics. Handbook of Social Psychology, 2, 80–203.Google Scholar

Nister, D. & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06) (pp. 2161–2168), vol. 2.CrossRef Google Scholar

O’Brien, S.M. (2004). Cutpoint selection for categorizing a continuous predictor. Biometrics, 60(2), 504–509.CrossRef Google Scholar PubMed

Opsomer, J., Wang, Y. & Yang, Y. (2001). Nonparametric regression with correlated errors. Statistical Science, 16(2), 134–153.CrossRef Google Scholar

Penrose, R. (1955). A generalized inverse for matrices. In Mathematical Proceedings of the Cambridge Philosophical Society (pp. 406–413), vol. 51. Cambridge University Press.CrossRef Google Scholar

Quan, Z. & Valdez, E.A. (2018). Predictive analytics of insurance claims using multivariate decision trees. Dependence Modeling, 6(1), 377–407.CrossRef Google Scholar

Shao, J. (1993). Linear model selection by cross-validation. Journal of the American Statistical Association, 88(422), 486–494.CrossRef Google Scholar

Snoek, J., Larochelle, H. & Adams, R.P. (2012). Practical Bayesian optimization of machine learning algorithms. In Advances in Neural Information Processing Systems (pp. 2951–2959).Google Scholar

Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society. Series B (Methodological), 36(2), 111–147.CrossRef Google Scholar

Strasser, H. & Weber, C. (1999). On the asymptotic theory of permutation statistics, Vienna University of Economics and Business Administration, Augasse 2–6, A–1090 Vienna, Austria.Google Scholar

Strobl, C., Boulesteix, A.-L., Zeileis, A. & Hothorn, T. (2007). Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics, 8(1), 25.CrossRef Google Scholar PubMed

Strobl, C., Malley, J. & Tutz, G. (2009). An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychological Methods, 14(4), 323.CrossRef Google Scholar PubMed

Westfall, P.H. & Young, S. (1993). Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment. John Wiley & Sons, Inc., New York, NY.Google Scholar

Xu, W., Chen, Y., Coleman, C. & Coleman, T.F. (2018). Moment matching machine learning methods for risk management of large variable annuity portfolios. Journal of Economic Dynamics and Control, 87, 1–20.CrossRef Google Scholar

Article contents

Tree-based models for variable annuity valuation: parameter tuning and empirical analysis

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests