Covariation and weighting of harmonically decomposed streams for ASR
Jackson, PJB, Moreno, DM, Russell, MJ and Hernando, J (2003) Covariation and weighting of harmonically decomposed streams for ASR In: 8th European Conference on Speech Communication and Technology. Eurospeech 2003, 2003-09-01 - 2003-09-04, Geneva.
Available under License : See the attached licence file.
Decomposition of speech signals into simultaneous streams of periodic and aperiodic information has been successfully applied to speech analysis, enhancement, modification and recently recognition. This paper examines the effect of different weightings of the two streams in a conventional HMM system in digit recognition tests on the Aurora 2.0 database. Comparison of the results from using matched weights during training showed a small improvement of approximately 10% relative to unmatched ones, under clean test conditions. Principal component analysis of the covariation amongst the periodic and aperiodic features indicated that only 45 (51) of the 78 coefficients were required to account for 99% of the variance, for clean (multi-condition) training, which yielded an 18.4% (10.3%) absolute increase in accuracy with respect to the baseline. These findings provide further evidence of the potential for harmonically-decomposed streams to improve performance and substantially to enhance recognition accuracy in noise.
|Item Type:||Conference or Workshop Item (Conference Paper)|
|Divisions :||Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing|
|Date :||1 September 2003|
|Additional Information :||Posted here by permission of ISCA|
|Depositing User :||Symplectic Elements|
|Date Deposited :||14 May 2013 17:13|
|Last Modified :||23 Sep 2013 20:07|
Actions (login required)
Downloads per month over past year