Speech Recognition and Understanding Systems

Nils J. Nilsson

doi:10.1017/CBO9780511819346.021

17 - Speech Recognition and Understanding Systems

Published online by Cambridge University Press: 05 August 2013

Nils J. Nilsson

Show author details

Nils J. Nilsson: Affiliation:
Stanford University

Book contents

Get access

Summary

Speech Processing

The NLP systems I have already described required that their English input be in text format. Yet, there are several instances in which speaking to a computer would be preferable to typing at one. People can generally speak faster than they can type (about three words per second versus about one word per second), and they can speak while they are moving about. Also, speaking does not tie up hands or eyes.

In discussing the problem of computer processing of speech, it is important to make some distinctions. One involves the difference between recognizing an isolated spoken word versus processing a continuous stream of speech. Most AI research has concentrated on the second and harder of these problems. Another distinction is between speech recognition and speech understanding.

By speech recognition is meant the process of converting an acoustic stream of speech input, as gathered by a microphone and associated electronic equipment, into a text representation of its component words. This process is difficult because many acoustic streams sound similar but are composed of quite different words. (Consider, for example, the spoken versions of “There are many ways to recognize speech,” and “There are many ways to wreck a nice beach.”) Speech understanding, in contrast, requires that what is spoken be understood. An utterance can be said to be understood if it elicits an appropriate action or response, and this might even be possible without recognizing all of its words.

Type: Chapter
Information: The Quest for Artificial Intelligence , pp. 209 - 223

DOI: https://doi.org/10.1017/CBO9780511819346.021 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

17 - Speech Recognition and Understanding Systems

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive