Skip to main content Accessibility help
×
Hostname: page-component-77c89778f8-m42fx Total loading time: 0 Render date: 2024-07-16T22:13:56.412Z Has data issue: false hasContentIssue false

2 - Basic audio processing

Published online by Cambridge University Press:  05 June 2016

Ian Vince McLoughlin
Affiliation:
University of Kent
Get access

Summary

Most speech and audio researchers use MATLAB as a preferred tool for audio processing, although many of us will make use of other specialised tools from time to time, such as sox for command line audio processing(particularly when there are a large number of files to convert or process, something it can do with a single command line option), and the sound capture and editing tool audacity which can record, edit, manipulate, convert and play back numerous types of audio file. In fact both of these programs are extremely capable open source tools, having far more options than could be described here. However, while very useful, neither tool can replace the abilities of MATLAB to easily develop scripts that make use of hundreds of built-in functions and operators, and can plot or visualise speech and other sounds in a multitude of ways.

Recorded speech or other sounds are stored within MATLAB (as well as in many other computer-based tools) as a vector of samples, with each individual value being a double precision floating point number. A sampled sound can be completely specified by the vector of these numbers as long as one other item of information is known: the sample rate at which the data was recorded. To replay the sampled sound, it is only necessary to sequentially output a voltage proportional to the stored vector information, with a gap between samples equivalent to the inverse of the sample rate.

General audio programs and tools store audio information similarly, except that they tend to use fixed point numbers rather than floating point, which can reduce the storage requirement by a factor of four at the expense of very little degradation – assuming the system is correctly designed. In particular, a consideration of overflow and underflow effects is usually needed when designing a system that uses fixed point storage for audio, whereas in floating point-based tools such as MATLAB this is rarely a concern in practice.

Any operation that MATLAB can perform on a general vector can, in theory, be performed on stored audio. In fact, this is how we typically perform audio processing within MATLAB, and the audio vector can be loaded and saved in much the same way as any other MATLAB variable. Likewise it can be processed, added, plotted, inverted, transformed and so on.

Type
Chapter
Information
Speech and Audio Processing
A MATLAB-based Approach
, pp. 9 - 53
Publisher: Cambridge University Press
Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×