University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Recent developments in automated lip-reading

Bowden, R, Cox, S, Harvey, R, Lan, Y, Ong, E-J, Owen, G and Theobald, B-J (2013) Recent developments in automated lip-reading In: Conference on Optics and Photonics for Counterterrorism, Crime Fighting and Defence IX; and Optical Materials and Biomaterials in Security and Defence Systems Technology X, 2013-09-23 - 2013-09-25, Dresden, GERMANY.

[img]
Preview
Text
SPIE_2013.pdf - ["content_typename_Submitted version (pre-print)" not defined]
Available under License : See the attached licence file.

Download (9MB) | Preview
[img]
Preview
Text (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Human lip-readers are increasingly being presented as useful in the gathering of forensic evidence but, like all humans, suffer from unreliability. Here we report the results of a long-term study in automatic lip-reading with the objective of converting video-to-text (V2T). The V2T problem is surprising in that some aspects that look tricky, such as real-time tracking of the lips on poor-quality interlaced video from hand-held cameras, but prove to be relatively tractable. Whereas the problem of speaker independent lip-reading is very demanding due to unpredictable variations between people. Here we review the problem of automatic lip-reading for crime fighting and identify the critical parts of the problem.

Item Type: Conference or Workshop Item (Conference Paper)
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
NameEmailORCID
Bowden, R
Cox, S
Harvey, R
Lan, Y
Ong, E-J
Owen, G
Theobald, B-J
Date : 1 January 2013
Identification Number : 10.1117/12.2029464
Contributors :
ContributionNameEmailORCID
http://www.loc.gov/loc.terms/relators/EDTBurgess, D
http://www.loc.gov/loc.terms/relators/EDTOwen, G
http://www.loc.gov/loc.terms/relators/EDTZamboni, R
http://www.loc.gov/loc.terms/relators/EDTKajzar, F
http://www.loc.gov/loc.terms/relators/EDTSzep, AA
http://www.loc.gov/loc.terms/relators/PBLSPIE-INT SOC OPTICAL ENGINEERING,
Uncontrolled Keywords : Science & Technology, Technology, Physical Sciences, Materials Science, Biomaterials, Optics, Materials Science, Lip-reading, speech recognition, pattern recognition
Related URLs :
Additional Information : Copyright 2013 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.
Depositing User : Symplectic Elements
Date Deposited : 17 Nov 2015 17:20
Last Modified : 31 Oct 2017 17:46
URI: http://epubs.surrey.ac.uk/id/eprint/808967

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800