Unsupervised Learning

doi:10.1017/CBO9781139342674.012

12 - Unsupervised Learning

from II - Predictive Modeling Methods

Published online by Cambridge University Press: 05 August 2014

Louise Francis

Edited by

Edward W. Frees ,

Richard A. Derrig and

Glenn Meyers

Show author details

Edward W. Frees: Affiliation:
University of Wisconsin, Madison
Richard A. Derrig: Affiliation:
Temple University, Philadelphia
Glenn Meyers: Affiliation:
ISO Innovative Analytics, New Jersey

Book contents

Get access

Summary

Chapter Preview. The focus of this chapter is on various methods of unsupervised learning. Unsupervised learning is contrasted with supervised learning, and the role of unsupervised learning in a supervised analysis is also discussed. The concept of dimension reduction is presented first, followed by the common methods of dimension reduction, principal components/factor analysis, and clustering. More recent developments regarding classic techniques such as fuzzy clustering are then introduced. Illustrative examples that use publicly available databases are presented. At the end of the chapter there are exercises that use data supplied with the chapter. Free R code and datasets are available on the book's website.

Introduction

Even before any of us took a formal course in statistics, we were familiar with supervised learning, though it is not referred to as such. For instance, we may read in the newspaper that people who text while driving experience an increase in accidents. When the research about texting and driving was performed, there was a dependent variable (occurrence of accident or near accident) and independent variables or predictors (use of cell phone along with other variables that predict accidents).

In finance class, students may learn about the capital asset pricing model (CAPM)

R = α + βRM + ε,

where the return on an individual stock R is a constant α plus beta times the return for market RM plus an error ε.

Type: Chapter
Information: Predictive Modeling Applications in Actuarial Science , pp. 280 - 312

DOI: https://doi.org/10.1017/CBO9781139342674.012 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aldenderfer, M. S. and R. K., Blashfield (1984). Cluster Analysis. Sage Publications.CrossRef Google Scholar

Brockett, P. L., R. A., Derrig, L. L., Golden, A., Levine, and M., Alpert (2002). Fraud classification using principal component analysis of RIDITs. Journal of Risk and Insurance 69(3), 341–371.CrossRef Google Scholar

Brzezinski, J. R. (1981). Patterns in persistency. Transactions 33, 203.Google Scholar

Christopherson, S. and D., Werland (1996). Using a geographic information system to identify territory boundaries. Casualty Actuarial Society Forum.

Coaley, K. (2010). An Introduction to Psychological Assessment and Psychometrics. Sage Publications.Google Scholar

Crawley, M. (2007). The R book.

Erin, Research (2003). Comparison analysis implications report of employer and member research. Prepared for the Society of Actuaries.

Everitt, B., S., Landan, M., Leese, and D., Stahl (2011). Cluster Analysis. Wiley, New York.CrossRef Google Scholar

Francis, L. (2001). Neural networks demystified. Casualty Actuarial Society Forum, 253–320.Google Scholar

Francis, L. (2003). Martian chronicles: Is MARS better than neural networks? Casualty Actuarial Society Forum, 75–102.Google Scholar

Francis, L. (2006). Review of PRIDIT. CAS Ratemaking Seminar.

Francis, L. and M., Flynn (2010). Text mining handbook. Casualty Actuarial Society E-Forum, Spring 2010, 1.Google Scholar

Francis, L. and V. R., Prevosto (2010). Data and disaster: The role of data in the financial crisis. Casualty Actuarial Society E-Forum, Spring 2010, 62.Google Scholar

Freedman, A. and C., Reynolds (2009). Cluster modeling: A new technique to improve model efficiency. CompAct. The Society of Actuaries.Google Scholar

Gorden, R. L. (1977). Unidimensional Scaling of Social Variables: Concepts and Procedures. Free Press, New York.Google Scholar

Kaufman, L. and P. J., Rousseeuw (1990). Finding Groups in Data. Wiley, New York.CrossRef Google Scholar

Kim, J.-O. and C. W., Mueller (1978). Factor Analysis: Statistical Methods and Practical Issues, Volume 14. Sage Publications, Thousand Oaks, CA.CrossRef Google Scholar

Maechler, M. (2012). Package “cluster” downloaded from CRAN website. www.r-project.org.

Mahmoud, O. (2008). A multivariate model for predicting the efficiency of financial performance of property liability insurance companies. CAS Discussion Paper Program.

Maranell, G. M. (1974). Scaling: A Sourcebook for Behavioral Scientists. Transaction Publishers.Google Scholar

Oksanen, J. (2012). Cluster analysis: Tutorial with R. http://www.statmethods.net/index.html.

Polon, J. (2008). Dental insurance fraud detection with predictive modeling. Presented at Society of Actuaries Spring Health Meeting.

Sanche, R. and K., Lonergan (2006). Variable reduction for predictive modeling with clustering. Casualty Actuarial Society Forum, 89–100.Google Scholar

Smith, L. (2002). A tutorial on principal components. www.sccg.sk/~haladova/principal_components.pdf.

Struyf, A., M., Hubert, and P., Rousseeuw (1997). Clustering in an object-oriented environment. Journal of Statistical Software 1(4), 1–30.Google Scholar

Venables, W. N., B. D., Ripley, and W., Venables (1999). Modern Applied Statistics with S-PLUS. Springer, New York.CrossRef Google Scholar

Weibel, E. J. and J. P., Walsh (2008). Territory analysis with mixed models and clustering. Presented at Casualty Actuarial Society Spring Meeting.

Zhang, P., X., Wang, and P. X.-K., Song (2006). Clustering categorical data based on distance vectors. Journal of the American Statistical Association 101(473), 355–367.CrossRef Google Scholar

Book contents

12 - Unsupervised Learning

Summary

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive