Multimodal signal processing for meetings: an introduction

doi:10.1017/CBO9781139136310.001

1 - Multimodal signal processing for meetings: an introduction

Published online by Cambridge University Press: 05 July 2012

Andrei Popescu-Belis and

Edited by

Jean Carletta and

Andrei Popescu-Belis: Affiliation:
Idiap Research Institute, Martigny, Switzerland
Jean Carletta: Affiliation:
University of Edinburgh, UK
Steve Renals: Affiliation:
University of Edinburgh
Hervé Bourlard: Affiliation:
Idiap Research Institute
Jean Carletta: Affiliation:
University of Edinburgh
Andrei Popescu-Belis: Affiliation:
Idiap Research Institute, Martigny, Switzerland

Book contents

Get access

Summary

This book is an introduction to multimodal signal processing. In it, we use the goal of building applications that can understand meetings as a way to focus and motivate the processing we describe. Multimodal signal processing takes the outputs of capture devices running at the same time – primarily cameras and microphones, but also electronic whiteboards and pens – and automatically analyzes them to make sense of what is happening in the space being recorded. For instance, these analyses might indicate who spoke, what was said, whether there was an active discussion, and who was dominant in it. These analyses require the capture of multimodal data using a range of signals, followed by a low-level automatic annotation of them, gradually layering up annotation until information that relates to user requirements is extracted.

Multimodal signal processing can be done in real time, that is, fast enough to build applications that influence the group while they are together, or offline – not always but often at higher quality – for later review of what went on. It can also be done for groups that are all together in one space, typically an instrumented meeting room, or for groups that are in different spaces but use technology such as videoconferencing to communicate. The book thus introduces automatic approaches to capturing, processing, and ultimately understanding human interaction in meetings, and describes the state of the art for all technologies involved.

Type: Chapter
Information: Multimodal Signal Processing
Human Interactions in Meetings
, pp. 1 - 10

DOI: https://doi.org/10.1017/CBO9781139136310.001 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2012

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

1 - Multimodal signal processing for meetings: an introduction

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive