University of Surrey

Test tubes in the lab Research in the ATI Dance Research

On the disjointess of sources in music using different time-frequency representations

Giannoulis, D, Barchiesi, D, Klapuri, A and Plumbley, MD (2011) On the disjointess of sources in music using different time-frequency representations In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011-10-16 - 2011-10-19, New Paltz, NY.

[img]
Preview
Text
GianoullisBarchiesiKlapuriP11-waspaa_accepted_notice.pdf - Accepted version Manuscript

Download (133kB) | Preview

Abstract

This paper studies the disjointness of the time-frequency representations of simultaneously playing musical instruments. As a measure of disjointness, we use the approximate W-disjoint orthogonality as proposed by Yilmaz and Rickard [1], which (loosely speaking) measures the degree of overlap of different sources in the time-frequency domain. The motivation for this study is to find a maximally disjoint representation in order to facilitate the separation and recognition of musical instruments in mixture signals. The transforms investigated in this paper include the short-time Fourier transform (STFT), constant-Q transform, modified discrete cosine transform (MDCT), and pitch-synchronous lapped orthogonal transforms. Simulation results are reported for a database of polyphonic music where the multitrack data (instrument signals before mixing) were available. Absolute performance varies depending on the instrument source in question, but on the average MDCT with 93 ms frame size performed best.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Giannoulis, DUNSPECIFIEDUNSPECIFIED
Barchiesi, DUNSPECIFIEDUNSPECIFIED
Klapuri, AUNSPECIFIEDUNSPECIFIED
Plumbley, MDm.plumbley@surrey.ac.ukUNSPECIFIED
Date : 16 October 2011
Identification Number : 10.1109/ASPAA.2011.6082321
Copyright Disclaimer : © 2011 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Contributors :
ContributionNameEmailORCID
UNSPECIFIEDIEEE, UNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords : Source separation, W-disjoint orthogonality, constant Q transform, MDCT, pitch-synchronous analysis
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:19
Last Modified : 29 Nov 2017 12:07
URI: http://epubs.surrey.ac.uk/id/eprint/838783

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800