University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Progressive sparse representation-based classification using local discrete cosine transform evaluation for image recognition

Song, Xiaoning, Feng, Zhenhua, Hu, Guosheng, Yang, Xibei, Qi, Yunsong and Yang, Jingyu (2015) Progressive sparse representation-based classification using local discrete cosine transform evaluation for image recognition Journal of Electronic Imaging, 24 (5), 053010.


Download (3MB) | Preview


This paper proposes a progressive sparse representation-based classification algorithm using local discrete cosine transform (DCT) evaluation to perform face recognition. Specifically, the sum of the contributions of all training samples of each subject is first taken as the contribution of this subject, then the redundant subject with the smallest contribution to the test sample is iteratively eliminated. Second, the progressive method aims at representing the test sample as a linear combination of all the remaining training samples, by which the representation capability of each training sample is exploited to determine the optimal “nearest neighbors” for the test sample. Third, the transformed DCT evaluation is constructed to measure the similarity between the test sample and each local training sample using cosine distance metrics in the DCT domain. The final goal of the proposed method is to determine an optimal weighted sum of nearest neighbors that are obtained under the local correlative degree evaluation, which is approximately equal to the test sample, and we can use this weighted linear combination to perform robust classification. Experimental results conducted on the ORL database of faces (created by the Olivetti Research Laboratory in Cambridge), the FERET face database (managed by the Defense Advanced Research Projects Agency and the National Institute of Standards and Technology), AR face database (created by Aleix Martinez and Robert Benavente in the Computer Vision Center at U.A.B), and USPS handwritten digit database (gathered at the Center of Excellence in Document Analysis and Recognition at SUNY Buffalo) demonstrate the effectiveness of the proposed method.

Item Type: Article
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
Song, Xiaoningxnsong@hotmail.comUNSPECIFIED
Hu, Guoshenghuguosheng100@gmail.comUNSPECIFIED
Date : 18 August 2015
Identification Number :
Additional Information : Copyright 2015 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.
Depositing User : Zhenhua Feng
Date Deposited : 05 Feb 2016 12:41
Last Modified : 05 Feb 2016 12:41

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800