Source localization and separation using random sample consensus with phase cues
Litwic, Ł and Jackson, PJB (2011) Source localization and separation using random sample consensus with phase cues IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 337 - 340. ISSN 1931-1168
In this paper we present a system for localization and separation of multiple speech sources using phase cues. The novelty of this method is the use of Random Sample Consensus (RANSAC) approach to find consistency of interaural phase differences (IPDs) across the whole frequency range. This approach is inherently free from phase ambiguity problems and enables all phase data to contribute to localization. Another property of RANSAC is its robustness against outliers which enables multiple source localization with phase data contaminated by reverberation noise. Results of RANSAC based localization are fed into a mixture model to generate time-frequency binary masks for separation. System performance is compared against other well known methods and shows similar or improved performance in reverberant conditions.
|Divisions :||Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing|
|Date :||19 October 2011|
|Identification Number :||10.1109/ASPAA.2011.6082334|
|Additional Information :||
Copyright 2008 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
|Depositing User :||Symplectic Elements|
|Date Deposited :||17 Feb 2012 15:58|
|Last Modified :||23 Sep 2013 19:03|
Actions (login required)
Downloads per month over past year