University of Surrey

Test tubes in the lab Research in the ATI Dance Research

A joint detection-classification model for audio tagging of weakly labelled data

Kong, Qiuqiang, Xu, Yong, Wang, Wenwu and Plumbley, Mark (2017) A joint detection-classification model for audio tagging of weakly labelled data In: ICASSP 2017, The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing, 2017-03-05 - 2017-03-09, New Orleans, USA.

[img]
Preview
Text
A joint detection-classification model for audio tagging of weakly labelled data v0.6.pdf - Accepted version Manuscript
Available under License : See the attached licence file.

Download (358kB) | Preview
[img]
Preview
PDF (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Audio tagging aims to assign one or several tags to an audio clip. Most of the datasets are weakly labelled, which means only the tags of the clip are known, without knowing the occurrence time of the tags. The labeling of an audio clip is often based on the audio events in the clip and no event level label is provided to the user. Previous works have used the bag of frames model assume the tags occur all the time, which is not the case in practice. We propose a joint detection-classification (JDC) model to detect and classify the audio clip simultaneously. The JDC model has the ability to attend to informative and ignore uninformative sounds. Then only informative regions are used for classification. Experimental results on the “CHiME Home” dataset show that the JDC model reduces the equal error rate (EER) from 19.0% to 16.9%. More interestingly, the audio event detector is trained successfully without needing the event level label.

Item Type: Conference or Workshop Item (Conference Poster)
Subjects : Electronic Engineering
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Kong, Qiuqiangq.kong@surrey.ac.ukUNSPECIFIED
Xu, Yongyong.xu@surrey.ac.ukUNSPECIFIED
Wang, WenwuW.Wang@surrey.ac.ukUNSPECIFIED
Plumbley, Markm.plumbley@surrey.ac.ukUNSPECIFIED
Date : 2017
Copyright Disclaimer : © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Contributors :
ContributionNameEmailORCID
UNSPECIFIEDIEEE, UNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords : audio tagging, weakly labelled data, joint detection-classification model, acoustic event detection
Depositing User : Symplectic Elements
Date Deposited : 16 Dec 2016 14:38
Last Modified : 18 Jul 2017 15:14
URI: http://epubs.surrey.ac.uk/id/eprint/813128

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800