Multimodal speech separation
Rivet, B and Chambers, J (2010) Multimodal speech separation Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5933 L. pp. 1-11.
Full text not available from this repository.Abstract
The work of Bernstein and Benoît has confirmed that it is advantageous to use multiple senses, for example to employ both audio and visual modalities, in speech perception. As a consequence, looking at the speaker's face can be useful to better hear a speech signal in a noisy environment and to extract it from competing sources, as originally identified by Cherry, who posed the so-called "Cocktail Party" problem. To exploit the intrinsic coherence between audition and vision within a machine, the method of blind source separation (BSS) is particularly attractive. © 2010 Springer-Verlag.
Item Type: | Article | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Divisions : | Surrey research (other units) | |||||||||
Authors : |
|
|||||||||
Date : | 30 April 2010 | |||||||||
DOI : | 10.1007/978-3-642-11509-7_1 | |||||||||
Depositing User : | Symplectic Elements | |||||||||
Date Deposited : | 17 May 2017 13:25 | |||||||||
Last Modified : | 24 Jan 2020 23:55 | |||||||||
URI: | http://epubs.surrey.ac.uk/id/eprint/839155 |
Actions (login required)
![]() |
View Item |
Downloads
Downloads per month over past year