University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Deep neural network based audio source separation

Zermini, Alfredo, Yu, Yang, Xu, Yong, Plumbley, Mark and Wang, Wenwu (2016) Deep neural network based audio source separation In: 11th IMA International Conference on Mathematics in Signal Processing, 2016-12-12 - 2016-12-14, IET Austin Court, Birmingham, UK.

[img]
Preview
Text
Deep neural network based audio source separation.pdf - Accepted version Manuscript

Download (1MB) | Preview

Abstract

Audio source separation aims to extract individual sources from mixtures of multiple sound sources. Many techniques have been developed such as independent compo- nent analysis, computational auditory scene analysis, and non-negative matrix factorisa- tion. A method based on Deep Neural Networks (DNNs) and time-frequency (T-F) mask- ing has been recently developed for binaural audio source separation. In this method, the DNNs are used to predict the Direction Of Arrival (DOA) of the audio sources with respect to the listener which is then used to generate soft T-F masks for the recovery/estimation of the individual audio sources.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Zermini, Alfredoalfredo.zermini@surrey.ac.ukUNSPECIFIED
Yu, YangUNSPECIFIEDUNSPECIFIED
Xu, Yongyong.xu@surrey.ac.ukUNSPECIFIED
Plumbley, Markm.plumbley@surrey.ac.ukUNSPECIFIED
Wang, WenwuW.Wang@surrey.ac.ukUNSPECIFIED
Date : 14 December 2016
Copyright Disclaimer : © 2017 The Authors.
Depositing User : Clive Harris
Date Deposited : 11 Aug 2017 08:19
Last Modified : 11 Aug 2017 08:19
URI: http://epubs.surrey.ac.uk/id/eprint/841889

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800