University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Perceptual evaluation of blind source separation in object-based audio production

Coleman, Philip, Liu, Qingju, Francombe, Jon and Jackson, Philip (2018) Perceptual evaluation of blind source separation in object-based audio production In: 14th International Conference on Latent Variable Analysis and Signal Separation, 02-06 Jul 2018, University of Surrey, Guildford, UK.

Perceptual evaluation of blind source separation in object-based audio production.pdf - Accepted version Manuscript

Download (244kB) | Preview


Object-based audio has the potential to enable multime- dia content to be tailored to individual listeners and their reproduc- tion equipment. In general, object-based production assumes that the objects|the assets comprising the scene|are free of noise and inter- ference. However, there are many applications in which signal separa- tion could be useful to an object-based audio work ow, e.g., extracting individual objects from channel-based recordings or legacy content, or recording a sound scene with a single microphone array. This paper de- scribes the application and evaluation of blind source separation (BSS) for sound recording in a hybrid channel-based and object-based workflow, in which BSS-estimated objects are mixed with the original stereo recording. A subjective experiment was conducted using simultaneously spoken speech recorded with omnidirectional microphones in a rever- berant room. Listeners mixed a BSS-extracted speech object into the scene to make the quieter talker clearer, while retaining acceptable au- dio quality, compared to the raw stereo recording. Objective evaluations show that the relative short-term objective intelligibility and speech qual- ity scores increase using BSS. Further objective evaluations are used to discuss the in uence of the BSS method on the remixing scenario; the scenario shown by human listeners to be useful in object-based audio is shown to be a worse-case scenario.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Arts and Social Sciences > Department of Music and Media
Authors :
Date : 6 June 2018
Funders : Engineering and Physical Sciences Research Council (EPSRC)
DOI : 10.1007/978-3-319-93764-9_51
Grant Title : EPSRC Programme Grant S3A: Future Spatial Audio for an Immersive Listener Experience at Home
Copyright Disclaimer : © 2018 Springer Verlag
Related URLs :
Additional Information : Part of the Lecture Notes in Computer Science book series (LNCS, volume 10891)
Depositing User : Clive Harris
Date Deposited : 05 Apr 2018 13:37
Last Modified : 05 Mar 2019 10:13

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800