University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Multimodal speech separation

Rivet, B and Chambers, J (2010) Multimodal speech separation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5933 L. pp. 1-11.

Full text not available from this repository.


The work of Bernstein and Benoît has confirmed that it is advantageous to use multiple senses, for example to employ both audio and visual modalities, in speech perception. As a consequence, looking at the speaker's face can be useful to better hear a speech signal in a noisy environment and to extract it from competing sources, as originally identified by Cherry, who posed the so-called "Cocktail Party" problem. To exploit the intrinsic coherence between audition and vision within a machine, the method of blind source separation (BSS) is particularly attractive. © 2010 Springer-Verlag.

Item Type: Article
Divisions : Surrey research (other units)
Authors :
Rivet, B
Date : 30 April 2010
DOI : 10.1007/978-3-642-11509-7_1
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:25
Last Modified : 24 Jan 2020 23:55

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800