University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Speaker authentication using video-based lip information

Goswami, B, Chan, C, Kittler, J and Christmas, W (2011) Speaker authentication using video-based lip information In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2011-05-22 - 2011-05-27, Prague, Czech Republic.

[img] Text
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (171kB)
[img] Text (licence)
Restricted to Repository staff only

Download (33kB)


The lip-region can be interpreted as either a genetic or behavioural biometric trait depending on whether static or dynamic information is used. In this paper, we use a texture descriptor called Local Ordinal Contrast Pattern (LOCP) in conjunction with a novel spatiotemporal sampling method called Windowed Three Orthogonal Planes (WTOP) to represent both appearance and dynamics features observed in visual speech. This representation, with standard speaker verification engines, is shown to improve the performance of the lipbiometric trait compared to the state-of-the-art. The improvement obtained suggests that there is enough discriminative information in the mouth-region to enable its use as a primary biometric as opposed to a "soft" biometric trait.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
Goswami, B
Chan, C
Kittler, J
Christmas, W
Date : 2011
DOI : 10.1109/ICASSP.2011.5946880
Depositing User : Symplectic Elements
Date Deposited : 28 Mar 2017 14:59
Last Modified : 31 Oct 2017 14:20

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800