University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Multimodal speech separation

Rivet, B and Chambers, J (2010) Multimodal speech separation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5933 L. pp. 1-11.

Full text not available from this repository.

Abstract

The work of Bernstein and Benoît has confirmed that it is advantageous to use multiple senses, for example to employ both audio and visual modalities, in speech perception. As a consequence, looking at the speaker's face can be useful to better hear a speech signal in a noisy environment and to extract it from competing sources, as originally identified by Cherry, who posed the so-called "Cocktail Party" problem. To exploit the intrinsic coherence between audition and vision within a machine, the method of blind source separation (BSS) is particularly attractive. © 2010 Springer-Verlag.

Item Type: Article
Authors :
NameEmailORCID
Rivet, BUNSPECIFIEDUNSPECIFIED
Chambers, Jj.a.chambers@surrey.ac.ukUNSPECIFIED
Date : 30 April 2010
Identification Number : https://doi.org/10.1007/978-3-642-11509-7_1
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:25
Last Modified : 17 May 2017 13:25
URI: http://epubs.surrey.ac.uk/id/eprint/839155

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800