Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS
Liu, Q, Wang, W and Jackson, PJB (2010) Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS In: 9th International Conference on Latent Variable Analysis and Signal Separation (formerly the International Conference on Independent Component Analysis and Signal Separation), 2010-09-27 - 2010-09-30, St. Malo, France.
LiuWangJackson_LVA10.pdf - Accepted Version
Available under License : See the attached licence file.
Plain Text (licence)
Recent studies show that visual information contained in visual speech can be helpful for the performance enhancement of audio-only blind source separation (BSS) algorithms. Such information is exploited through the statistical characterisation of the coherence between the audio and visual speech using, e.g. a Gaussian mixture model (GMM). In this paper, we present two new contributions. An adapted expectation maximization (AEM) algorithm is proposed in the training process to model the audio-visual coherence upon the extracted features. The coherence is exploited to solve the permutation problem in the frequency domain using a new sorting scheme. We test our algorithm on the XM2VTS multimodal database. The experimental results show that our proposed algorithm outperforms traditional audio-only BSS.
|Item Type:||Conference or Workshop Item (Paper)|
|Additional Information:||The original publication is available at http://www.springerlink.com|
|Divisions:||Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing|
|Depositing User:||Symplectic Elements|
|Date Deposited:||09 Dec 2011 11:27|
|Last Modified:||09 Jun 2014 13:45|
Actions (login required)
Downloads per month over past year