University of Surrey

Test tubes in the lab Research in the ATI Dance Research

JSpeech: a multi-lingual conversational speech corpus

Choobbasti, Ali Janalizadeh, Gholamian, Mohammad Erfan, Vaheb, Amir and Safavi, Saeid (2018) JSpeech: a multi-lingual conversational speech corpus In: 2018 IEEE Spoken Language Technology Workshop (SLT 2018), 18-21 Dec 2018, Athens, Greece.

[img]
Preview
Text
JSPEECH.pdf - Accepted version Manuscript

Download (544kB) | Preview

Abstract

Speech processing, automatic speech and speaker recognition are the major area of interests in the field of computational linguistics. Research and development of computer and human interaction, forensic technologies and dialogue systems have been the motivating factor behind this interest. In this paper, JSpeech is introduced, a multilingual corpus. This corpus contains 1332 hours of conversational speech from 47 different languages. This corpus can be used in a variety of studies, created from 106 public chat group the effect of language variability on the performance of speaker recognition systems and automatic language detection. To this end, we include speaker verification results obtained for this corpus using a state of the art method based on 3D convolutional neural network.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Choobbasti, Ali Janalizadeh
Gholamian, Mohammad Erfan
Vaheb, Amir
Safavi, Saeids.safavi@surrey.ac.uk
Date : 18 December 2018
Funders : European Union's Horizon 2020
Grant Title : AudioCommons
Copyright Disclaimer : © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Uncontrolled Keywords : Multilingual speech corpus; Speaker recognition; Convolutional neural network; Machine learning
Related URLs :
Depositing User : Clive Harris
Date Deposited : 04 Feb 2019 14:28
Last Modified : 04 Feb 2019 14:28
URI: http://epubs.surrey.ac.uk/id/eprint/850355

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800