University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Improving the perceptual quality of ideal binary masked speech

Lightburn, L, De Sena, Enzo, Moore, A, Naylor, PA and Brookes, M (2017) Improving the perceptual quality of ideal binary masked speech In: ICASSP 2017 - 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017-03-05 - 2017-03-09, New Orleans, USA.

[img] Text
Paper_v29.pdf - Accepted version Manuscript
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (199kB)
[img]
Preview
Text (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to supply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency regions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.

Item Type: Conference or Workshop Item (Conference Paper)
Subjects : Music and Media
Divisions : Faculty of Arts and Social Sciences > School of Arts > Music
Authors :
NameEmailORCID
Lightburn, LUNSPECIFIEDUNSPECIFIED
De Sena, Enzoe.desena@surrey.ac.ukUNSPECIFIED
Moore, AUNSPECIFIEDUNSPECIFIED
Naylor, PAUNSPECIFIEDUNSPECIFIED
Brookes, MUNSPECIFIEDUNSPECIFIED
Date : 19 June 2017
Identification Number : 10.1109/ICASSP.2017.7952238
Copyright Disclaimer : (c) 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Contributors :
ContributionNameEmailORCID
UNSPECIFIEDIEEE, UNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords : — Binary mask, speech quality, speech intelligibility, speech enhancement, speech presence probability
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 18 Jan 2017 10:22
Last Modified : 24 Aug 2017 13:15
URI: http://epubs.surrey.ac.uk/id/eprint/813318

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800