Progressive sparse representation-based classification using local discrete cosine transform evaluation for image recognition
Song, Xiaoning, Feng, Zhenhua, Hu, Guosheng, Yang, Xibei, Qi, Yunsong and Yang, Jingyu (2015) Progressive sparse representation-based classification using local discrete cosine transform evaluation for image recognition Journal of Electronic Imaging, 24 (5), 053010.
Download (3MB) | Preview
This paper proposes a progressive sparse representation-based classification algorithm using local discrete cosine transform (DCT) evaluation to perform face recognition. Specifically, the sum of the contributions of all training samples of each subject is first taken as the contribution of this subject, then the redundant subject with the smallest contribution to the test sample is iteratively eliminated. Second, the progressive method aims at representing the test sample as a linear combination of all the remaining training samples, by which the representation capability of each training sample is exploited to determine the optimal “nearest neighbors” for the test sample. Third, the transformed DCT evaluation is constructed to measure the similarity between the test sample and each local training sample using cosine distance metrics in the DCT domain. The final goal of the proposed method is to determine an optimal weighted sum of nearest neighbors that are obtained under the local correlative degree evaluation, which is approximately equal to the test sample, and we can use this weighted linear combination to perform robust classification. Experimental results conducted on the ORL database of faces (created by the Olivetti Research Laboratory in Cambridge), the FERET face database (managed by the Defense Advanced Research Projects Agency and the National Institute of Standards and Technology), AR face database (created by Aleix Martinez and Robert Benavente in the Computer Vision Center at U.A.B), and USPS handwritten digit database (gathered at the Center of Excellence in Document Analysis and Recognition at SUNY Buffalo) demonstrate the effectiveness of the proposed method.
|Divisions :||Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing|
|Date :||18 August 2015|
|Identification Number :||10.1117/1.JEI.24.5.053010|
|Additional Information :||Copyright 2015 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.|
|Depositing User :||Zhenhua Feng|
|Date Deposited :||05 Feb 2016 12:41|
|Last Modified :||05 Feb 2016 12:41|
Actions (login required)
Downloads per month over past year