Title :
Data preparation for pre-processing on oral cancer dataset
Author :
Mohd, Farahwahida ; Abu Bakar, Zainab ; Noor, Noor Maizura Mohamad ; Rajion, Z.A.
Author_Institution :
Comput. Sci. Dept., Univ. Malaysia Terengganu (UMT), KualaTerengganu, Malaysia
Abstract :
In this paper, data pre-processing tasks involving data interpretation, data integration, noisy data, missing data, and data inconsistency are presented. The dataset prepared includes all the fields that are required for the research, pertaining to oral cancer diagnosis with demographics, social habit, clinical symptoms, and histological variables. After data normalization and transformation, the finding of the study prepared oral cancer dataset with 27 attributes as a part of study contribution. There are only one continuous and one numerical variable, which are case_id and age. The remaining variables are discrete or categorical variables.
Keywords :
cancer; data integration; demography; medical diagnostic computing; clinical symptoms; data inconsistency; data integration; data interpretation; data normalization; data preparation; data transformation; demographics; histological variables; missing data; noisy data; oral cancer dataset preprocessing; oral cancer diagnosis; social habit; Alcoholic beverages; Data integration; Mouth; Pain; Polynomials; Tongue; Welding; clinical dataset; normalization; oral cancer; preprocessing;
Conference_Titel :
Control, Automation and Systems (ICCAS), 2013 13th International Conference on
Conference_Location :
Gwangju
Print_ISBN :
978-89-93215-05-2
DOI :
10.1109/ICCAS.2013.6703916