University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Efficient learning of local image descriptors.

Balntas, Vassileios (2016) Efficient learning of local image descriptors. Doctoral thesis, University of Surrey.

PhD-Thesis.pdf - Version of Record
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (4MB) | Preview


One of the most important tasks of modern computer vision with a vast amount of applications is finding correspondences between local patches extracted from different views of a physical scene. In this thesis, we investigate three main axes of this problem. We first provide a critical review of the prior work related to methods for extracting local image descriptors. Next, we show that the intrinsic visual characteristics of a patch may fundamentally alter its matching process, and we show how to exploit this phenomenon to improve the matching performance. One of the main contributions of this thesis is a novel approach to describing and matching image patches. We introduce a per-patch adapted method which makes it possible to generate feature descriptors that use simple binary tests, but match the performance of methods of significantly higher complexity. We also demonstrate that our technique can be successfully generalised to other descriptors, thus showing its potential for more general applications. We then propose novel methods to learn compact and efficient patch representations using convolutional neural networks. We show that typically used approaches such as architectural expansions or hard negative mining are not essential for the success of such methods. Our convolutional descriptors outperform the state of the art approaches at a significant fraction of the computational cost. Lastly, we demonstrate that most of the work in the area suffers from non-reproducibilty and inconsistency of evaluation results. To that end, we introduce a novel dataset accompanied with improved protocols and benchmarks that will allow for reproducible results. More importantly, the scale of our dataset allows for experimentation with learning local feature descriptors from real-world data, something that has not been feasible so far due to the lack of data. This will allow improved results and new experiments especially in the context of deep learning and convolutional neural networks.

Item Type: Thesis (Doctoral)
Subjects : Computer Vision
Divisions : Theses
Authors :
Date : 21 December 2016
Funders : Faculty of Engineering and Physical Sciences
Contributors :
ContributionNameEmailORCID,, Lilian
Related URLs :
Depositing User : Vassileios Balntas
Date Deposited : 05 Jan 2017 09:07
Last Modified : 31 Oct 2017 18:58

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800