University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Breast Cancer Data Analytics With Missing Values: A study on Ethnic, Age and Income Groups

Tirunagari, S, Poh, N, Abdulrahman, H, Nemmour, N and Windridge, D (2015) Breast Cancer Data Analytics With Missing Values: A study on Ethnic, Age and Income Groups

[img]
Preview
Text
1503.03680v1.pdf - Author's Original
Available under License : See the attached licence file.

Download (293kB) | Preview
[img]
Preview
Text (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview
[img] Text
1503.03680v1.pdf - Author's Original
Restricted to Repository staff only
Available under License : See the attached licence file.

Download (293kB)

Abstract

An analysis of breast cancer incidences in women and the relationship between ethnicity and survival rate has been an ongoing study with recorded incidences of missing values in the secondary data. In this paper, we study and report the results of breast cancer survival rate by ethnicity, age and income groups from the dataset collected for 53593 patients in South East England between the years 1998 and 2003. In addition to this, we also predict the missing values for the ethnic groups in the dataset. The principle findings in our study suggest that: 1) women of white ethnicity in South East England have a highest percentage of survival rate when compared to the black ethnicity, 2) High income groups have higher survival rates to that of lower income groups and 3) Age groups between 80-95 have lower percentage of survival rate.

Item Type: Article
Subjects : Computer Science
Authors :
AuthorsEmailORCID
Tirunagari, SUNSPECIFIEDUNSPECIFIED
Poh, NUNSPECIFIEDUNSPECIFIED
Abdulrahman, HUNSPECIFIEDUNSPECIFIED
Nemmour, NUNSPECIFIEDUNSPECIFIED
Windridge, DUNSPECIFIEDUNSPECIFIED
Date : 12 March 2015
Copyright Disclaimer : Copyright 2015 The Author(s). This is an arXiv version of the paper.
Uncontrolled Keywords : q-bio.QM, q-bio.QM
Related URLs :
Additional Information : The paper analyzes a breast cancer data with missing values, where the missing values of ethnicity are imputed based on a Naive Bayes classifier. Further, the data was analysed from domain purpose as well such as the effect of ethnicity, age, and income on the survival of the breast cancer
Depositing User : Symplectic Elements
Date Deposited : 04 Nov 2016 14:39
Last Modified : 04 Nov 2016 14:39
URI: http://epubs.surrey.ac.uk/id/eprint/812307

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800