Annealed On-line Learning in Multilayer Neural Networks

doi:10.1017/CBO9780511569920.011

10 - Annealed On-line Learning in Multilayer Neural Networks

Published online by Cambridge University Press: 28 January 2010

Siegfried Bös and

Shun-Ichi Amari

Edited by

David Saad

Show author details

Siegfried Bös: Affiliation:
Brain Science Institute, RIKEN Wako–shi, Saitama 351–0198, Japan
Shun-Ichi Amari: Affiliation:
Brain Science Institute, RIKEN Wako–shi, Saitama 351–0198, Japan
David Saad: Affiliation:
Aston University

Book contents

Get access

Summary

Abstract

In this article we will examine online learning with an annealed learning rate. Annealing the learning rate is necessary if online learning is to reach its optimal solution. With a fixed learning rate, the system will approximate the best solution only up to some fluctuations. These fluctuations are proportional to the size of the fixed learning rate. It has been shown that an optimal annealing can make online learning asymptotically efficient meaning that asymptotically it learns as fast as possible. These results are until now only realized in very simple networks, like single–layer perceptrons (section 3). Even the simplest multilayer network, the soft committee machine, shows an additional symptom, which makes straightforward annealing uneffective. This is because, at the beginning of learning the committee machine is attracted by a metastable, suboptimal solution (section 4). The system stays in this metastable solution for a long time and can only leave it, if the learning rate is not too small. This delays the start of annealing considerably. Here we will show that a non–local or matrix update can prevent the system from becoming trapped in the metastable phase, allowing for annealing to start much earlier (section 5). Some remarks on the influence of the initial conditions and a possible candidate for a theoretical support are discussed in section 6. The paper ends with a summary of future tasks and a conclusion.

Introduction

One of the most attractive properties of artificial neural networks is their ability to learn from examples and to generalize the acquired knowledge to unknown data.

Type: Chapter
Information: On-Line Learning in Neural Networks , pp. 209 - 230

DOI: https://doi.org/10.1017/CBO9780511569920.011 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1999

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

10 - Annealed On-line Learning in Multilayer Neural Networks

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive