University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Hierarchical Learning for DNN-Based Acoustic Scene Classification

Xu, Y, Huang, Q, Wang, W and Plumbley, MD (2016) Hierarchical Learning for DNN-Based Acoustic Scene Classification In: DCASE2016 Workshop (Workshop on Detection and Classification of Acoustic Scenes and Events), 2016-09-03 - 2016-09-03, Budapest, Hungary.

[img]
Preview
Text
HIERARCHICAL LEARNING FOR DNN-BASED ACOUSTIC SCENE CLASSIFICATION.pdf - Version of Record
Available under License : See the attached licence file.

Download (514kB) | Preview
[img]
Preview
PDF (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

In this paper, we present a deep neural network (DNN)-based acoustic scene classification framework. Two hierarchical learning methods are proposed to improve the DNN baseline performance by incorporating the hierarchical taxonomy information of environmental sounds. Firstly, the parameters of the DNN are initialized by the proposed hierarchical pre-training. Multi-level objective function is then adopted to add more constraint on the cross-entropy based loss function. A series of experiments were conducted on the Task1 of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2016 challenge. The final DNN-based system achieved a 22.9% relative improvement on average scene classification error as compared with the Gaussian Mixture Model (GMM)-based benchmark system across four standard folds.

Item Type: Conference or Workshop Item (Conference Paper)
Subjects : Electronic Engineering
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
AuthorsEmailORCID
Xu, YUNSPECIFIEDUNSPECIFIED
Huang, QUNSPECIFIEDUNSPECIFIED
Wang, WUNSPECIFIEDUNSPECIFIED
Plumbley, MDUNSPECIFIEDUNSPECIFIED
Date : 3 September 2016
Funders : Engineering and Physical Sciences Research (EPSRC)
Copyright Disclaimer : This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Contributors :
ContributionNameEmailORCID
UNSPECIFIEDVirtanen, TUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDMesaros, AUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDHeittola, TUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDPlumbley, MDUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDFoster, PUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDBenetos, EUNSPECIFIEDUNSPECIFIED
UNSPECIFIEDLagrange, MUNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords : Acoustic scene classification, Deep neural network, Hierarchical pre-training, Multi-level objective function
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 20 Sep 2016 12:06
Last Modified : 20 Sep 2016 12:06
URI: http://epubs.surrey.ac.uk/id/eprint/812241

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800