University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Push and pull: Iterative grouping of media

Gilbert, A and Bowden, R (2011) Push and pull: Iterative grouping of media BMVC 2011 - Proceedings of the British Machine Vision Conference 2011. 66.1-66.12.

[img]
Preview
Text
Gilbert_paper66.pdf - ["content_typename_Submitted version (pre-print)" not defined]
Available under License : See the attached licence file.

Download (430kB) | Preview
[img]
Preview
PDF (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

We present an approach to iteratively cluster images and video in an efficient and intuitive manor. While many techniques use the traditional approach of time consuming groundtruthing large amounts of data [10, 16, 20, 23], this is increasingly infeasible as dataset size and complexity increase. Furthermore it is not applicable to the home user, who wants to intuitively group his/her own media without labelling the content. Instead we propose a solution that allows the user to select media that semantically belongs to the same class and use machine learning to "pull" this and other related content together. We introduce an "image signature" descriptor and use min-Hash and greedy clustering to efficiently present the user with clusters of the dataset using multi-dimensional scaling. The image signatures of the dataset are then adjusted by APriori data mining identifying the common elements between a small subset of image signatures. This is able to both pull together true positive clusters and push apart false positive examples. The approach is tested on real videos harvested from the web using the state of the art YouTube dataset [18]. The accuracy of correct group label increases from 60.4% to 81.7% using 15 iterations of pulling and pushing the media around. While the process takes only 1 minute to compute the pair wise similarities of the image signatures and visualise the youtube whole dataset. © 2011. The copyright of this document resides with its authors.

Item Type: Article
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing
Authors :
AuthorsEmailORCID
Gilbert, AUNSPECIFIEDUNSPECIFIED
Bowden, RUNSPECIFIEDUNSPECIFIED
Date : 1 September 2011
Identification Number : 10.5244/C25.66
Additional Information : Copyright 2011. The copyright of this document resides with its authors.
Depositing User : Symplectic Elements
Date Deposited : 17 Nov 2015 19:01
Last Modified : 17 Nov 2015 19:01
URI: http://epubs.surrey.ac.uk/id/eprint/808986

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800