University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Pitch synchronous speech coding techniques.

Sturt, Christian. (2003) Pitch synchronous speech coding techniques. Doctoral thesis, University of Surrey (United Kingdom)..

Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (50MB) | Preview


Efficient source coding techniques are necessary to make optimal use of the limited bandwidth available in mobile phone networks. Most current mobile telephone communication systems compress the speech waveform by using speech coders based on the Code Excited Linear Prediction (CELP) model. Such coders give high quality speech at bit rates of 8 kbps and above. Below 8 kbps, the quality of the coded speech degrades rapidly. At rates of 6 kbps and below, parametric speech coders offer better speech quality. These coders reduce the required bit rate by transmitting certain characteristics of the speech waveform to the decoder, rather than attempting to code the waveform itself. The disadvantage of parametric coders is that the maximum achievable quality is limited by assumptions made during the coding of the speech signal. The aim of the research presented is to investigate and eliminate the factors that limit the speech quality of parametric coders. A new pitch synchronous coding model is proposed that operates on individual pitch cycle waveforms of speech rather than longer, fixed length frames as used in classic techniques. In order to implement a pitch synchronous coder, new pitch cycle detection algorithms have been proposed. Pitch synchronous parameter analysis was investigated and several new techniques have been developed. A novel pitch synchronous split-band voicing estimator has been proposed that utilises only the phase of the speech harmonics rather than the periodicity used in traditional techniques. Fixed rate quantisation of pitch synchronous speech parameters has been investigated and a joint quantisation/interpolation scheme has been proposed. This scheme has been applied to the quantisation of the pitch synchronous parameters and has been shown to outperform traditional quantisation techniques. A comparison of a reference parametric coder with its pitch synchronous counterpart has shown that the pitch synchronous paradigm eliminates some of the main factors that limit the speech quality in parametric coders. It is expected that this will lead to the development of speech coders that can produce speech of higher quality than current parametric coders operating at the same bit rate. Key words: Speech Coding, Pitch Synchronous, Sinusoidal Coding, Split-Band LPC Coding.

Item Type: Thesis (Doctoral)
Divisions : Theses
Authors :
Date : 2003
Contributors :
Depositing User : EPrints Services
Date Deposited : 09 Nov 2017 12:14
Last Modified : 15 Mar 2018 21:56

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800