University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

Grais, EM and Erdogan, H (2011) Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks

Full text not available from this repository.

Abstract

A single channel speech-music separation algorithm based on nonnegative matrix factorization (NMF) with sliding windows and spectral masks is proposed in this work. We train a set of basis vectors for each source signal using NMF in the magnitude spectral domain. Rather than forming the columns of the matrices to be decomposed by NMF of a single spectral frame, we build them with multiple spectral frames stacked in one column. After observing the mixed signal, NMF is used to decompose its magnitude spectra into a weighted linear combination of the trained basis vectors for both sources. An initial spectrogram estimate for each source is found, and a spectral mask is built using these initial estimates. This mask is used to weight the mixed signal spectrogram to find the contributions of each source signal in the mixed signal. The method is shown to perform better than the conventional NMF approach. Copyright © 2011 ISCA.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Authors :
NameEmailORCID
Grais, EMUNSPECIFIEDUNSPECIFIED
Erdogan, HUNSPECIFIEDUNSPECIFIED
Date : 1 December 2011
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:54
Last Modified : 17 May 2017 13:54
URI: http://epubs.surrey.ac.uk/id/eprint/840744

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800