University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Adaptation of speaker-specific bases in non-negative matrix factorization for single channel speech-music separation

Grais, EM and Erdogan, H (2011) Adaptation of speaker-specific bases in non-negative matrix factorization for single channel speech-music separation

Full text not available from this repository.

Abstract

This paper introduces a speaker adaptation algorithm for nonnegative matrix factorization (NMF) models. The proposed adaptation algorithm is a combination of Bayesian and subspace model adaptation. The adapted model is used to separate speech signal from a background music signal in a single record. Training speech data for multiple speakers is used with NMF to train a set of basis vectors as a general model for speech signals. The probabilistic interpretation of NMF is used to achieve Bayesian adaptation to adjust the general model with respect to the actual properties of the speech signals that is observed in the mixed signal. The Bayesian adapted model is adapted again by a linear transform, which changes the subspace that the Bayesian adapted model spans to better match the speech signal that is in the mixed signal. The experimental results show that combining Bayesian with linear transform adaptation improves the separation results. Copyright © 2011 ISCA.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Authors :
NameEmailORCID
Grais, EMUNSPECIFIEDUNSPECIFIED
Erdogan, HUNSPECIFIEDUNSPECIFIED
Date : 1 December 2011
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:54
Last Modified : 17 May 2017 13:54
URI: http://epubs.surrey.ac.uk/id/eprint/840743

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800