University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Investigating Language Variability on the Performance of Speaker Verification Systems

Vaheb, Amir, Choobbasti, Ali Janalizadeh, Najafabadi, S. H. E. Mortazavi and Safavi, Saeid (2018) Investigating Language Variability on the Performance of Speaker Verification Systems In: 20th International Conference on Speech and Computer (SPECOM 2018), 18-22 Sep 2018, Leipzig, Germany.

Investigating Language Variability on the Performance of Speaker Verification Systems.pdf - Accepted version Manuscript

Download (403kB) | Preview


In recent years, speaker verification technologies have received an extensive amount of attention. Designing and developing machines that could communicate with humans are believed to be one of the primary motivations behind such developments. Speaker verification technologies are applied to numerous fields such as security, Biometrics, and forensics.

In this paper, the authors study the effects of different languages on the performance of the automatic speaker verification (ASV) system. The MirasVoice speech corpus (MVSC), a bilingual English and Farsi speech corpus, is used in this study. This study collects results from both an I-vector based ASV system and a GMM-UBM based ASV system. The experimental results show that a mismatch between the enrolled data used for training and verification data can lead to a significant decrease in the overall system efficiency. This study shows that it is best to use an i-vector based framework with data from the English language used in the enrollment phase to improve the robustness of the ASV systems. The achieved results in this study indicate that this can narrow the degradation gap caused by the language mismatch.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
Vaheb, Amir
Choobbasti, Ali Janalizadeh
Najafabadi, S. H. E. Mortazavi
Date : 25 August 2018
DOI : 10.1007/978-3-319-99579-3_73
Copyright Disclaimer : © Springer Nature Switzerland AG 2018 This is a post-peer-review, pre-copyedit version of an article published in Lecture Notes in Computer Science. The final authenticated version is available online at:
Uncontrolled Keywords : Speaker verification; Bilingual speech corpus; Gaussian mixture model; i-vector system
Depositing User : Clive Harris
Date Deposited : 04 Feb 2019 15:43
Last Modified : 04 Feb 2019 15:43

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800