Skip to main content Accessibility help
×
Hostname: page-component-76fb5796d-45l2p Total loading time: 0 Render date: 2024-04-25T11:57:11.571Z Has data issue: false hasContentIssue false

12 - Unsupervised Learning

from II - Predictive Modeling Methods

Published online by Cambridge University Press:  05 August 2014

Edward W. Frees
Affiliation:
University of Wisconsin, Madison
Richard A. Derrig
Affiliation:
Temple University, Philadelphia
Glenn Meyers
Affiliation:
ISO Innovative Analytics, New Jersey
Get access

Summary

Chapter Preview. The focus of this chapter is on various methods of unsupervised learning. Unsupervised learning is contrasted with supervised learning, and the role of unsupervised learning in a supervised analysis is also discussed. The concept of dimension reduction is presented first, followed by the common methods of dimension reduction, principal components/factor analysis, and clustering. More recent developments regarding classic techniques such as fuzzy clustering are then introduced. Illustrative examples that use publicly available databases are presented. At the end of the chapter there are exercises that use data supplied with the chapter. Free R code and datasets are available on the book's website.

Introduction

Even before any of us took a formal course in statistics, we were familiar with supervised learning, though it is not referred to as such. For instance, we may read in the newspaper that people who text while driving experience an increase in accidents. When the research about texting and driving was performed, there was a dependent variable (occurrence of accident or near accident) and independent variables or predictors (use of cell phone along with other variables that predict accidents).

In finance class, students may learn about the capital asset pricing model (CAPM)

R = α + βRM + ε,

where the return on an individual stock R is a constant α plus beta times the return for market RM plus an error ε.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aldenderfer, M. S. and R. K., Blashfield (1984). Cluster Analysis. Sage Publications.CrossRefGoogle Scholar
Brockett, P. L., R. A., Derrig, L. L., Golden, A., Levine, and M., Alpert (2002). Fraud classification using principal component analysis of RIDITs. Journal of Risk and Insurance 69(3), 341–371.CrossRefGoogle Scholar
Brzezinski, J. R. (1981). Patterns in persistency. Transactions 33, 203.Google Scholar
Christopherson, S. and D., Werland (1996). Using a geographic information system to identify territory boundaries. Casualty Actuarial Society Forum.
Coaley, K. (2010). An Introduction to Psychological Assessment and Psychometrics. Sage Publications.Google Scholar
Crawley, M. (2007). The R book.
Erin, Research (2003). Comparison analysis implications report of employer and member research. Prepared for the Society of Actuaries.
Everitt, B., S., Landan, M., Leese, and D., Stahl (2011). Cluster Analysis. Wiley, New York.CrossRefGoogle Scholar
Francis, L. (2001). Neural networks demystified. Casualty Actuarial Society Forum, 253–320.Google Scholar
Francis, L. (2003). Martian chronicles: Is MARS better than neural networks? Casualty Actuarial Society Forum, 75–102.Google Scholar
Francis, L. (2006). Review of PRIDIT. CAS Ratemaking Seminar.
Francis, L. and M., Flynn (2010). Text mining handbook. Casualty Actuarial Society E-Forum, Spring 2010, 1.Google Scholar
Francis, L. and V. R., Prevosto (2010). Data and disaster: The role of data in the financial crisis. Casualty Actuarial Society E-Forum, Spring 2010, 62.Google Scholar
Freedman, A. and C., Reynolds (2009). Cluster modeling: A new technique to improve model efficiency. CompAct. The Society of Actuaries.Google Scholar
Gorden, R. L. (1977). Unidimensional Scaling of Social Variables: Concepts and Procedures. Free Press, New York.Google Scholar
Kaufman, L. and P. J., Rousseeuw (1990). Finding Groups in Data. Wiley, New York.CrossRefGoogle Scholar
Kim, J.-O. and C. W., Mueller (1978). Factor Analysis: Statistical Methods and Practical Issues, Volume 14. Sage Publications, Thousand Oaks, CA.CrossRefGoogle Scholar
Maechler, M. (2012). Package “cluster” downloaded from CRAN website. www.r-project.org.
Mahmoud, O. (2008). A multivariate model for predicting the efficiency of financial performance of property liability insurance companies. CAS Discussion Paper Program.
Maranell, G. M. (1974). Scaling: A Sourcebook for Behavioral Scientists. Transaction Publishers.Google Scholar
Oksanen, J. (2012). Cluster analysis: Tutorial with R. http://www.statmethods.net/index.html.
Polon, J. (2008). Dental insurance fraud detection with predictive modeling. Presented at Society of Actuaries Spring Health Meeting.
Sanche, R. and K., Lonergan (2006). Variable reduction for predictive modeling with clustering. Casualty Actuarial Society Forum, 89–100.Google Scholar
Smith, L. (2002). A tutorial on principal components. www.sccg.sk/~haladova/principal_components.pdf.
Struyf, A., M., Hubert, and P., Rousseeuw (1997). Clustering in an object-oriented environment. Journal of Statistical Software 1(4), 1–30.Google Scholar
Venables, W. N., B. D., Ripley, and W., Venables (1999). Modern Applied Statistics with S-PLUS. Springer, New York.CrossRefGoogle Scholar
Weibel, E. J. and J. P., Walsh (2008). Territory analysis with mixed models and clustering. Presented at Casualty Actuarial Society Spring Meeting.
Zhang, P., X., Wang, and P. X.-K., Song (2006). Clustering categorical data based on distance vectors. Journal of the American Statistical Association 101(473), 355–367.CrossRefGoogle Scholar

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×