University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Sparse coding with adaptive dictionary learning for underdetermined blind speech separation

Xu, T, Wang, W and Dai, W (2013) Sparse coding with adaptive dictionary learning for underdetermined blind speech separation Speech Communication, 55 (3). pp. 432-450.

[img] Text
XuWD_SPCOM_2013.pdf - ["content_typename_Accepted version (post-print)" not defined]
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (1MB)
[img] Text
XuWD_SPCOM_2013_preprint.pdf - ["content_typename_Accepted version (post-print)" not defined]
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (4MB)
[img] PDF (licence)
SRI_deposit_agreement.pdf
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (33kB)

Abstract

A block-based approach coupled with adaptive dictionary learning is presented for underdetermined blind speech separation. The proposed algorithm, derived as a multi-stage method, is established by reformulating the underdetermined blind source separation problem as a sparse coding problem. First, the mixing matrix is estimated in the transform domain by a clustering algorithm. Then a dictionary is learned by an adaptive learning algorithm for which three algorithms have been tested, including the simultaneous codeword optimization (SimCO) technique that we have proposed recently. Using the estimated mixing matrix and the learned dictionary, the sources are recovered from the blocked mixtures by a signal recovery approach. The separated source components from all the blocks are concatenated to reconstruct the whole signal. The block-based operation has the advantage of improving considerably the computational efficiency of the source recovery process without degrading its separation performance. Numerical experiments are provided to show the competitive separation performance of the proposed algorithm, as compared with the state-of-the-art approaches. Using mutual coherence and sparsity index, the performance of a variety of dictionaries that are applied in underdetermined speech separation is compared and analyzed, such as the dictionaries learned from speech mixtures and ground truth speech sources, as well as those predefined by mathematical transforms such as discrete cosine transform (DCT) and short time Fourier transform (STFT). © 2012 Elsevier B.V. All rights reserved.

Item Type: Article
Authors :
AuthorsEmailORCID
Xu, TUNSPECIFIEDUNSPECIFIED
Wang, WUNSPECIFIEDUNSPECIFIED
Dai, WUNSPECIFIEDUNSPECIFIED
Date : 2013
Identification Number : https://doi.org/10.1016/j.specom.2012.12.003
Depositing User : Symplectic Elements
Date Deposited : 28 Mar 2017 13:25
Last Modified : 28 Mar 2017 13:25
URI: http://epubs.surrey.ac.uk/id/eprint/804519

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800