University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Unsupervised Ensemble Learning and Its Application to Temporal Data Clustering

Yang, Y (2011) Unsupervised Ensemble Learning and Its Application to Temporal Data Clustering UNSPECIFIED thesis, University of Surrey.

Full text not available from this repository.


Temporal data clustering can provide underpinning techniques for the discovery of intrinsic structures and can condense or summarize information contained in temporal data, demands made in various fields ranging from time series analysis to understanding sequential data. In the context of the treatment of data dependency in temporal data, existing temporal data clustering algorithms can be classified in three categories: model-based, temporal-proximity and feature-based clustering. However, unlike static data, temporal data have many distinct characteristics, including high dimensionality, complex time dependency, and large volume, all of which make the clustering of temporal data more challenging than conventional static data clustering. A large of number of recent studies have shown that unsupervised ensemble approaches improve clustering quality by combining multiple clustering solutions into a single consolidated clustering ensemble that has the best performance among given clustering solutions. This thesis systemically reviews existing temporal clustering and unsupervised ensemble learning techniques and proposes three unsupervised ensemble learning approaches for temporal data clustering. The first approach is based on the ensemble of HMM k-models clustering, associated with agglomerative clustering refinement, for solving problems with finding the intrinsic number of clusters, model initialization sensitivity and computational cost, problems which exist in most forms of model-based clustering. Secondly, we propose a sampling-based clustering ensemble approach namely the iteratively constructed clustering ensemble. Our approach iteratively constructs multiple partitions on the subset of whole input instances selected by a smart weighting scheme, combining the strength of both boosting and bagging approaches whilst attempting to simultaneously avoid their drawbacks. Finally, we propose a weighted ensemble learning approach to temporal data clustering which combines partitions obtained by different representations of temporal data. As a result, this approach has the capability to capture the properties of temporal data and the synergy created by reconciling diverse partitions due to combining different representations. The proposed weighted function has out-standing ability in automatic model selection and appropriate grouping for complex temporal data.

Item Type: Thesis (UNSPECIFIED)
Divisions : Surrey research (other units)
Authors :
Date : 2011
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 12:28
Last Modified : 23 Jan 2020 11:03

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800