University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Modelling for selecting vaccines against antigenically variable viruses.

Rahman, Tameera (2017) Modelling for selecting vaccines against antigenically variable viruses. Doctoral thesis, University of Surrey.

Text (PhD ethesis)
Thesis_Tameera Rahman.pdf - Version of Record
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (2MB) | Preview
[img] Text (Access restriction form)
2016_01_27_RestrictingAccessThesisForm.pdf - N/A or Unknown
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (382kB)


In vitro and in vivo selection of vaccines is time consuming, expensive and the selected vaccines may not be able to provide protection against broad-spectrum viruses owing to the complexities of emerging antigenically novel disease strains. A powerful computational model that can effectively predict antigenically variant strains can minimise the amount of resources spent on exclusive serological testing of vaccines and make broad spectrum vaccines possible for many diseases. However, in silico vaccine prediction remains a grand challenge. To address this challenge, we investigate the use of linear regression, non-linear regression and support vector machine (SVM) classification models to predict the antigenic similarity between footand- mouth disease virus (FMDV) strains. The parameters of the linear regression model are estimated using the least squares method and the structure and parameters of the non-linear model are optimised using a hybrid evolutionary algorithm. We apply semi-supervised classification methods i.e. transductive SVM (TSVM) to improve our classification results due to the availability of limited labelled data. In addition, we examine two different scoring methods for weighting the type of amino acid substitutions in the classification and regression models in two different setups i.e. the entire external viral capsid protein or only antigenically important areas in the capsid proteins are considered. Statistical analysis of our data confirmed possible correlates of amino acid substitutions in antigenic areas in capsid proteins of FMDV and influenza. Across all our prediction models, we achieved the best results when the scoring method based on biochemical properties of amino acids is employed in combination with regression or classification and models based on substitutions in the antigenic areas performed better than those that took the entire exposed viral capsid protein. In our regression analysis, the non-linear regression method optimised with the evolutionary algorithm performed consistently better (throughout FMDV and influenza datasets) than the linear and non-linear models whose parameters are estimated using the least squares method. In addition, for the best models, optimised non-linear regression models consist of more terms than their linear counterparts, implying a non-linear nature of influences of amino acid substitutions. For our classification models we also used Ebola data. Our TSVM models outperformed our SVM models across all datasets i.e. FMDV, influenza and Ebola, which confirmed the benefits of using unlabelled data for boosting generalization performance. However, including additional antigenic areas in our Ebola TSVM model had no effect on the prediction ability of the model which we think is because the additional peptides were not biologically significant in terms of relaying any effect on the antigenic values which we use as our labels.

Item Type: Thesis (Doctoral)
Subjects : computer science, bioinformatics
Divisions : Theses
Authors :
Rahman, Tameera
Date : 28 February 2017
Funders : Faculty of Engineering and Physical Sciences, Faulty of Health and Medical Sciences
Contributors :
Depositing User : Tameera Rahman
Date Deposited : 09 Mar 2017 12:03
Last Modified : 09 Nov 2018 16:40

Actions (login required)

View Item View Item


Downloads per month over past year

Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800