University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Multi-level attention model for weakly supervised audio classification

Yu, Changsong, Barsim, Karim Said, Kong, Qiuqiang and Yang, Bin (2018) Multi-level attention model for weakly supervised audio classification In: DCASE2018 Workshop on Detection and Classification of Acoustic Scenes and Events, 19 - 20 November 2018, Surrey, UK.

[img]
Preview
Text
MULTI-LEVEL ATTENTION MODEL FOR WEAKLY SUPERVISED AUDIO CLASSIFICATION (1).pdf - Accepted version Manuscript

Download (671kB) | Preview

Abstract

In this paper, we propose a multi-level attention model for the weakly labelled audio classification problem. The objective of audio classification is to predict the presence or the absence of sound events in an audio clip. Recently, Google published a large scale weakly labelled AudioSet dataset containing 2 million audio clips with only the presence or the absence labels of the sound events, without the onset and offset time of the sound events. Previously proposed attention models only applied a single attention module on the last layer of a neural network which limited the capacity of the attention model. In this paper, we propose a multi-level attention model which consists of multiple attention modules applied on the intermediate neural network layers. The outputs of these attention modules are concatenated to a vector followed by a fully connected layer to obtain the final prediction of each class. Experiments show that the proposed multi-attention attention model achieves a state-of-the-art mean average precision (mAP) of 0.360, outperforming the single attention model and the Google baseline system of 0.327 and 0.314, respectively.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Yu, Changsong
Barsim, Karim Said
Kong, Qiuqiangq.kong@surrey.ac.uk
Yang, Bin
Date : 2018
Uncontrolled Keywords : AudioSet, audio classification, attention model
Depositing User : Melanie Hughes
Date Deposited : 10 Oct 2018 07:48
Last Modified : 19 Nov 2018 02:08
URI: http://epubs.surrey.ac.uk/id/eprint/849626

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800