The Pattern Classification Problem

Martin Anthony; Peter L. Bartlett

doi:10.1017/CBO9780511624216.003

2 - The Pattern Classification Problem

Published online by Cambridge University Press: 26 February 2010

Martin Anthony and

Peter L. Bartlett

Show author details

Martin Anthony: Affiliation:
London School of Economics and Political Science
Peter L. Bartlett: Affiliation:
Australian National University, Canberra

Book contents

Get access

Summary

The Learning Problem

Introduction

In this section we describe the basic model of learning we use in this part of the book. This model is applicable to neural networks with one output unit that computes either the value 0 or 1; that is, it concerns the types of neural network used for binary classification problems. Later in the book we develop more general models of learning applicable to many other types of neural network, such as those with a real-valued output.

The definition of learning we use is formally described using the language of probability theory. For the moment, however, we move towards the definition in a fairly non-technical manner, providing some informal motivation for the technical definitions that will follow.

In very general terms, in a supervised learning environment, neural network ‘learning’ is the adjustment of the network's state in response to data generated by the environment. We assume this data is generated by some random mechanism, which is, for many applications, reasonable. The method by which the state of the network is adjusted in response to the data constitutes a learning algorithm. That is, a learning algorithm describes how to change the state in response to training data. We assume that the ‘learner’ knows little about the process generating the data. This is a reasonable assumption for many applications of neural networks: if it is known that the data is generated according to a particular type of statistical process, then in practice it might be better to take advantage of this information by using a more restricted class of functions rather than a neural network.

Type: Chapter
Information: Neural Network Learning
Theoretical Foundations
, pp. 13 - 28

DOI: https://doi.org/10.1017/CBO9780511624216.003 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1999

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

2 - The Pattern Classification Problem

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive