University of Surrey

Test tubes in the lab Research in the ATI Dance Research

On the Ideal Ratio Mask as the Goal of Computational Auditory Scene Analysis

Hummersone, C, Stokes, T and Brookes, T (2014) On the Ideal Ratio Mask as the Goal of Computational Auditory Scene Analysis In: Blind Source Separation: Advances in Theory, Algorithms and Applications. Signals and Communication Technology (12). Springer, Berlin/Heidelberg, pp. 349-368. ISBN 3642550169

[img] Text
IRM_as_CASA_Goal.pdf
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (1MB)
[img] PDF (licence)
SRI_deposit_agreement.pdf
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (33kB)

Abstract

The ideal binary mask (IBM) is widely considered to be the benchmark for time–frequency-based sound source separation techniques such as computational auditory scene analysis (CASA). However, it is well known that binary masking introduces objectionable distortion, especially musical noise. This can make binary masking unsuitable for sound source separation applications where the output is auditioned. It has been suggested that soft masking reduces musical noise and leads to a higher quality output. A previously defined soft mask, the ideal ratio mask (IRM), is found to have similar properties to the IBM, may correspond more closely to auditory processes, and offers additional computational advantages. Consequently, the IRM is proposed as the goal of CASA. To further support this position, a number of studies are reviewed that show soft masks to provide superior performance to the IBM in applications such as automatic speech recognition and speech intelligibility. A brief empirical study provides additional evidence demonstrating the objective and perceptual superiority of the IRM over the IBM.

Item Type: Book Section
Authors :
AuthorsEmailORCID
Hummersone, CUNSPECIFIEDUNSPECIFIED
Stokes, TUNSPECIFIEDUNSPECIFIED
Brookes, TUNSPECIFIEDUNSPECIFIED
Date : 22 May 2014
Uncontrolled Keywords : TECHNOLOGY & ENGINEERING
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 28 Mar 2017 13:11
Last Modified : 28 Mar 2017 13:11
URI: http://epubs.surrey.ac.uk/id/eprint/805914

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800