Push and pull: Iterative grouping of media
Gilbert, A and Bowden, R (2011) Push and pull: Iterative grouping of media BMVC 2011 - Proceedings of the British Machine Vision Conference 2011. 66.1-66.12.
|
Text
Gilbert_paper66.pdf - ["content_typename_Submitted version (pre-print)" not defined] Available under License : See the attached licence file. Download (430kB) | Preview |
|
|
Text (licence)
SRI_deposit_agreement.pdf Available under License : See the attached licence file. Download (33kB) | Preview |
Abstract
We present an approach to iteratively cluster images and video in an efficient and intuitive manor. While many techniques use the traditional approach of time consuming groundtruthing large amounts of data [10, 16, 20, 23], this is increasingly infeasible as dataset size and complexity increase. Furthermore it is not applicable to the home user, who wants to intuitively group his/her own media without labelling the content. Instead we propose a solution that allows the user to select media that semantically belongs to the same class and use machine learning to "pull" this and other related content together. We introduce an "image signature" descriptor and use min-Hash and greedy clustering to efficiently present the user with clusters of the dataset using multi-dimensional scaling. The image signatures of the dataset are then adjusted by APriori data mining identifying the common elements between a small subset of image signatures. This is able to both pull together true positive clusters and push apart false positive examples. The approach is tested on real videos harvested from the web using the state of the art YouTube dataset [18]. The accuracy of correct group label increases from 60.4% to 81.7% using 15 iterations of pulling and pushing the media around. While the process takes only 1 minute to compute the pair wise similarities of the image signatures and visualise the youtube whole dataset. © 2011. The copyright of this document resides with its authors.
Item Type: | Article |
---|---|
Divisions : | Faculty of Engineering and Physical Sciences > Electronic Engineering > Centre for Vision Speech and Signal Processing |
Authors : | Gilbert, A and Bowden, R |
Date : | 1 September 2011 |
DOI : | 10.5244/C25.66 |
Additional Information : | Copyright 2011. The copyright of this document resides with its authors. |
Depositing User : | Symplectic Elements |
Date Deposited : | 17 Nov 2015 19:01 |
Last Modified : | 06 Jul 2019 05:15 |
URI: | http://epubs.surrey.ac.uk/id/eprint/808986 |
Actions (login required)
![]() |
View Item |
Downloads
Downloads per month over past year