University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Sound event detection with sequentially labelled data based on Connectionist temporal classification and unsupervised clustering

Hou, Yuanbo, Kong, Qiuqiang, Li, Shengchen and Plumbley, Mark D. (2019) Sound event detection with sequentially labelled data based on Connectionist temporal classification and unsupervised clustering In: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), 12-17 May 2019, Brighton, UK.

[img]
Preview
Text
Sound event detection - AAM.pdf - Accepted version Manuscript

Download (414kB) | Preview

Abstract

Sound event detection (SED) methods typically rely on either strongly labelled data or weakly labelled data. As an alternative, sequentially labelled data (SLD) was proposed. In SLD, the events and the order of events in audio clips are known, without knowing the occurrence time of events. This paper proposes a connectionist temporal classification (CTC) based SED system that uses SLD instead of strongly labelled data, with a novel unsupervised clustering stage. Experiments on 41 classes of sound events show that the proposed two-stage method trained on SLD achieves performance comparable to the previous state-of-the-art SED system trained on strongly labelled data, and is far better than another state-of-the-art SED system trained on weakly labelled data, which indicates the effectiveness of the proposed two-stage method trained on SLD without any onset/offset time of sound events.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Hou, Yuanbo
Kong, Qiuqiangq.kong@surrey.ac.uk
Li, Shengchen
Plumbley, Mark D.m.plumbley@surrey.ac.uk
Date : 2019
Funders : Engineering and Physical Sciences Research Council (EPSRC)
Grant Title : Making Sense of Sounds
Copyright Disclaimer : © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Uncontrolled Keywords : Sound event detection; Sequentially labelled data; Convolutional recurrent neural network; Connectionist temporal classification; Unsupervised clustering
Related URLs :
Depositing User : Clive Harris
Date Deposited : 20 Mar 2019 08:54
Last Modified : 20 Mar 2019 10:23
URI: http://epubs.surrey.ac.uk/id/eprint/850807

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800