University of Surrey

Test tubes in the lab Research in the ATI Dance Research

A source separation evaluation method in object-based spatial audio

Liu, Q, Wang, W, Jackson, PJB and Cox, TJ (2015) A source separation evaluation method in object-based spatial audio In: 23rd European Signal Processing Conference (EUSIPCO), Nice, France.

[img]
Preview
Text
LiuEtAl_EUSIPCO15_Preprint_ASourceSeparationEvaluationMethodinObjectBasedSpatialAudio.pdf - ["content_typename_Submitted version (pre-print)" not defined]
Available under License : See the attached licence file.

Download (402kB) | Preview
[img]
Preview
PDF (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Representing a complex acoustic scene with audio objects is desirable but challenging in object-based spatial audio production and reproduction, especially when concurrent sound signals are present in the scene. Source separation (SS) provides a potentially useful and enabling tool for audio object extraction. These extracted objects are often remixed to reconstruct a sound field in the reproduction stage. A suitable SS method is expected to produce audio objects that ultimately deliver high quality audio after remix. The performance of these SS algorithms therefore needs to be evaluated in this context. Existing metrics for SS performance evaluation, however, do not take into account the essential sound field reconstruction process. To address this problem, here we propose a new SS evaluation method which employs a remixing strategy similar to the panning law, and provides a framework to incorporate the conventional SS metrics. We have tested our proposed method on real-room recordings processed with four SS methods, including two state-of-the art blind source separation (BSS) methods and two classic beamforming algorithms. The evaluation results based on three conventional SS metrics are analysed.

Item Type: Conference or Workshop Item (Conference Poster)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
AuthorsEmailORCID
Liu, QUNSPECIFIEDUNSPECIFIED
Wang, WUNSPECIFIEDUNSPECIFIED
Jackson, PJBUNSPECIFIEDUNSPECIFIED
Cox, TJUNSPECIFIEDUNSPECIFIED
Date : September 2015
Additional Information : (c) 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Depositing User : Symplectic Elements
Date Deposited : 23 Dec 2015 11:57
Last Modified : 23 Dec 2015 11:57
URI: http://epubs.surrey.ac.uk/id/eprint/809390

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800