University of Surrey

Test tubes in the lab Research in the ATI Dance Research

On the disjointess of sources in music using different time-frequency representations

Giannoulis, D, Barchiesi, D, Klapuri, A and Plumbley, MD (2011) On the disjointess of sources in music using different time-frequency representations In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011-10-16 - 2011-10-19, New Paltz, NY.

GianoullisBarchiesiKlapuriP11-waspaa_accepted_notice.pdf - Accepted version Manuscript

Download (133kB) | Preview


This paper studies the disjointness of the time-frequency representations of simultaneously playing musical instruments. As a measure of disjointness, we use the approximate W-disjoint orthogonality as proposed by Yilmaz and Rickard [1], which (loosely speaking) measures the degree of overlap of different sources in the time-frequency domain. The motivation for this study is to find a maximally disjoint representation in order to facilitate the separation and recognition of musical instruments in mixture signals. The transforms investigated in this paper include the short-time Fourier transform (STFT), constant-Q transform, modified discrete cosine transform (MDCT), and pitch-synchronous lapped orthogonal transforms. Simulation results are reported for a database of polyphonic music where the multitrack data (instrument signals before mixing) were available. Absolute performance varies depending on the instrument source in question, but on the average MDCT with 93 ms frame size performed best.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
Giannoulis, D
Barchiesi, D
Klapuri, A
Date : 16 October 2011
DOI : 10.1109/ASPAA.2011.6082321
Copyright Disclaimer : © 2011 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Contributors :
Uncontrolled Keywords : Source separation, W-disjoint orthogonality, constant Q transform, MDCT, pitch-synchronous analysis
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:19
Last Modified : 16 Jan 2019 18:43

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800