University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Recent developments in automated lip-reading

Bowden, R, Cox, S, Harvey, R, Lan, Y, Ong, E-J, Owen, G and Theobald, B-J (2013) Recent developments in automated lip-reading In: Conference on Optics and Photonics for Counterterrorism, Crime Fighting and Defence IX; and Optical Materials and Biomaterials in Security and Defence Systems Technology X, 2013-09-23 - 2013-09-25, Dresden, GERMANY.

[img]
Preview
Text
SPIE_2013.pdf - ["content_typename_Submitted version (pre-print)" not defined]
Available under License : See the attached licence file.

Download (9MB) | Preview
[img]
Preview
PDF (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Human lip-readers are increasingly being presented as useful in the gathering of forensic evidence but, like all humans, suffer from unreliability. Here we report the results of a long-term study in automatic lip-reading with the objective of converting video-to-text (V2T). The V2T problem is surprising in that some aspects that look tricky, such as real-time tracking of the lips on poor-quality interlaced video from hand-held cameras, but prove to be relatively tractable. Whereas the problem of speaker independent lip-reading is very demanding due to unpredictable variations between people. Here we review the problem of automatic lip-reading for crime fighting and identify the critical parts of the problem.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
AuthorsEmailORCID
Bowden, RUNSPECIFIEDUNSPECIFIED
Cox, SUNSPECIFIEDUNSPECIFIED
Harvey, RUNSPECIFIEDUNSPECIFIED
Lan, YUNSPECIFIEDUNSPECIFIED
Ong, E-JUNSPECIFIEDUNSPECIFIED
Owen, GUNSPECIFIEDUNSPECIFIED
Theobald, B-JUNSPECIFIEDUNSPECIFIED
Date : 1 January 2013
Identification Number : 10.1117/12.2029464
Contributors :
ContributionNameEmailORCID
EditorBurgess, DUNSPECIFIEDUNSPECIFIED
EditorOwen, GUNSPECIFIEDUNSPECIFIED
EditorZamboni, RUNSPECIFIEDUNSPECIFIED
EditorKajzar, FUNSPECIFIEDUNSPECIFIED
EditorSzep, AAUNSPECIFIEDUNSPECIFIED
PublisherSPIE-INT SOC OPTICAL ENGINEERING, UNSPECIFIEDUNSPECIFIED
Uncontrolled Keywords : Science & Technology, Technology, Physical Sciences, Materials Science, Biomaterials, Optics, Materials Science, Lip-reading, speech recognition, pattern recognition
Related URLs :
Additional Information : Copyright 2013 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.
Depositing User : Symplectic Elements
Date Deposited : 17 Nov 2015 17:20
Last Modified : 17 Nov 2015 17:20
URI: http://epubs.surrey.ac.uk/id/eprint/808967

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800