Optimisation Theory

Nello Cristianini; John Shawe-Taylor

doi:10.1017/CBO9780511801389.007

5 - Optimisation Theory

Published online by Cambridge University Press: 05 March 2013

Nello Cristianini and

John Shawe-Taylor

Show author details

Nello Cristianini: Affiliation:
University of London
John Shawe-Taylor: Affiliation:
Royal Holloway, University of London

Book contents

Get access

Summary

All of the inductive strategies presented in Chapter 4 have a similar form. The hypothesis function should be chosen to minimise (or maximise) a certain functional. In the case of linear learning machines (LLMs), this amounts to finding a vector of parameters that minimises (or maximises) a certain cost function, typically subject to some constraints. Optimisation theory is the branch of mathematics concerned with characterising the solutions of classes of such problems, and developing effective algorithms for finding them. The machine learning problem has therefore been converted into a form that can be analysed within the framework of optimisation theory.

Depending on the specific cost function and on the nature of the constraints, we can distinguish a number of classes of optimisation problems that are well understood and for which efficient solution strategies exist. In this chapter we will describe some of the results that apply to cases in which the cost function is a convex quadratic function, while the constraints are linear. This class of optimization problems are called convex quadratic programmes, and it is this class that proves adequate for the task of training SVMs.

Optimisation theory will not only provide us with algorithmic techniques, but also define the necessary and sufficient conditions for a given function to be a solution. An example of this is provided by the theory of duality, which will provide us with a natural interpretation of the dual representation of LLMs presented in the previous chapters. Furthermore, a deeper understanding of the mathematical structure of solutions will inspire many specific algorithmic heuristics and implementation techniques described in Chapter 7.

Type: Chapter
Information: An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , pp. 79 - 92

DOI: https://doi.org/10.1017/CBO9780511801389.007 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2000

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

5 - Optimisation Theory

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive