University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Perceptual quality of audio separated using sigmoidal masks

Stokes, T, Hummersone, C, Brookes, T and Mason, A (2014) Perceptual quality of audio separated using sigmoidal masks In: 137th Audio Engineering Society Convention 2014, 2014/10/09 - 2014/10/12, Los Angeles, USA.

[img] Text (licence)
SRI_deposit_agreement.pdf - Accepted version Manuscript
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (33kB)
[img] Text (deleted)
PERCEPTUAL QUALITY OF AUDIO SEPARATED USING SIGMOIDAL MASKS.pdf
Restricted to Repository staff only

Download (426kB)
[img]
Preview
Text
PERCEPTUAL QUALITY OF AUDIO SEPARATED USING SIGMOIDAL MASKS.pdf - Accepted version Manuscript
Available under License : See the attached licence file.

Download (426kB) | Preview

Abstract

Separation of underdetermined audio mixtures is often performed in the Time-Frequency (TF) domain by masking each TF element according to its target-to-mixture ratio. This work uses sigmoidal functions to map the target-to-mixture ratio to mask values. The series of functions used encompasses the ratio mask and an approximation of the binary mask. Mixtures are chosen to represent a range of different amounts of TF overlap, then separated and evaluated using objective measures. PEASS results show improved interferer suppression and artifact scores can be achieved using softer masking than that applied by binary or ratio masks. The improvement in these scores gives an improved overall perceptual score; this observation is repeated at multiple TF resolutions.

Item Type: Conference or Workshop Item (Conference Paper)
Subjects : Media
Divisions : Faculty of Arts and Social Sciences > School of Arts > Sound Recording
Authors :
AuthorsEmailORCID
Stokes, TUNSPECIFIEDUNSPECIFIED
Hummersone, CUNSPECIFIEDUNSPECIFIED
Brookes, TUNSPECIFIEDUNSPECIFIED
Mason, AUNSPECIFIEDUNSPECIFIED
Date : 9 October 2014
Copyright Disclaimer : © 2014 Audio Engineering Society. This paper was peer-reviewed as a complete manuscript for presentation at this Convention. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.
Depositing User : Symplectic Elements
Date Deposited : 18 Aug 2016 15:19
Last Modified : 18 Aug 2016 15:19
URI: http://epubs.surrey.ac.uk/id/eprint/808894

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800