University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Audio source separation with a signal-adaptive local cosine transform

Nesbit, Andrew, Plumbley, Mark D. and Davies, Mike E. (2007) Audio source separation with a signal-adaptive local cosine transform Signal Processing, 87 (8). pp. 1848-1858.

NesbitPlumbleyDavies07-sigpro_preprint_notice.pdf - Submitted Manuscript Under Review

Download (234kB) | Preview


Audio source separation is a very challenging problem, and many different approaches have been proposed in attempts to solve it. We consider the problem of separating sources from two-channel instantaneous audio mixtures. One approach to this is to transform the mixtures into the time-frequency domain to obtain approximately disjoint representations of the sources, and then separate the sources using time-frequency masking. We focus on demixing the sources by binary masking, and assume that the mixing parameters are known. In this paper, we investigate the application of cosine packet (CP) trees as a foundation for the transform.

We determine an appropriate transform by applying a computationally efficient best basis algorithm to a set of possible local cosine bases organised in a tree structure. We develop a heuristically motivated cost function which maximises the energy of the transform coefficients associated with a particular source. Finally, we evaluate objectively our proposed transform method by comparing it against fixed-basis transforms such as the short-time Fourier transform (STFT) and modified discrete cosine transform (MDCT). Evaluation results indicate that our proposed transform method outperforms MDCT and is competitive with the STFT, and informal listening tests suggest that the proposed method exhibits less objectionable noise than the STFT.

Item Type: Article
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
Nesbit, Andrew
Plumbley, Mark
Davies, Mike E.
Date : 7 March 2007
Funders : Engineering and Physical Sciences Research Council (EPSRC)
DOI : 10.1016/j.sigpro.2007.02.011
Copyright Disclaimer : © 2007 The Authors.
Uncontrolled Keywords : Audio source separation; Time-frequency masking; Local cosine basis
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:20
Last Modified : 17 Jan 2019 10:57

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800