University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Multi-Train: A Semi-supervised Heterogeneous Ensemble Classifier

Gu, Shenkai and Jin, Yaochu (2017) Multi-Train: A Semi-supervised Heterogeneous Ensemble Classifier Neurocomputing, 249. pp. 202-211.

[img] Text
mt.pdf - Accepted version Manuscript
Restricted to Repository staff only until 4 April 2018.
Available under License : See the attached licence file.

Download (322kB)
[img]
Preview
Text (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Many real-world machine learning tasks have very limited labeled data but a large amount of unlabeled data. To take advantage of the unlabeled data for enhancing learning performance, several semi-supervised learning techniques have been developed. In this paper, we propose a novel semi-supervised ensemble learning algorithm, termed Multi-Train, which generates a number of heterogeneous classifiers that use different classification models and/or different features. During the training process, each classifier is refined using unlabeled data, which are labeled by the majority prediction of the rest classifiers. We hypothesize that the use of different models and different input features can promote the diversity of the ensemble, thereby improving the performance compared to existing methods such as the co-training and tri-training algorithms. Experimental results on the UCI datasets clearly demonstrated the effectiveness of using heterogeneous ensembles in semi-supervised learning.

Item Type: Article
Subjects : Computing Science
Divisions : Faculty of Engineering and Physical Sciences > Computing Science
Authors :
NameEmailORCID
Gu, Shenkaishenkai.gu@surrey.ac.ukUNSPECIFIED
Jin, YaochuYaochu.Jin@surrey.ac.ukUNSPECIFIED
Date : 4 April 2017
Identification Number : 10.1016/j.neucom.2017.03.063
Copyright Disclaimer : © 2017. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
Uncontrolled Keywords : Unlabeled data, Classi cation, Heterogeneous ensembles, Semi-supervised learning, Tri-training, Multi-Train
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 05 Apr 2017 14:23
Last Modified : 08 Sep 2017 13:04
URI: http://epubs.surrey.ac.uk/id/eprint/813961

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800