Generalisation Theory

Nello Cristianini; John Shawe-Taylor

doi:10.1017/CBO9780511801389.006

4 - Generalisation Theory

Published online by Cambridge University Press: 05 March 2013

Nello Cristianini and

John Shawe-Taylor

Show author details

Nello Cristianini: Affiliation:
University of London
John Shawe-Taylor: Affiliation:
Royal Holloway, University of London

Book contents

Get access

Summary

The introduction of kernels greatly increases the expressive power of the learning machines while retaining the underlying linearity that will ensure that learning remains tractable. The increased flexibility, however, increases the risk of overfitting as the choice of separating hyperplane becomes increasingly ill-posed due to the number of degrees of freedom.

In Chapter 1 we made several references to the reliability of the statistical inferences inherent in the learning methodology. Successfully controlling the increased flexibility of kernel-induced feature spaces requires a sophisticated theory of generalisation, which is able to precisely describe which factors have to be controlled in the learning machine in order to guarantee good generalisation. Several learning theories exist that can be applied to this problem. The theory of Vapnik and Chervonenkis (VC) is the most appropriate to describe SVMs, and historically it has motivated them, but it is also possible to give a Bayesian interpretation, among others.

In this chapter we review the main results of VC theory that place reliable bounds on the generalisation of linear classifiers and hence indicate how to control the complexity of linear functions in kernel spaces. Also, we briefly review results from Bayesian statistics and compression schemes that can also be used to describe such systems and to suggest which parameters to control in order to improve generalisation.

Type: Chapter
Information: An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , pp. 52 - 78

DOI: https://doi.org/10.1017/CBO9780511801389.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2000

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

4 - Generalisation Theory

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive