DocumentCode :
251837
Title :
Pre-processing Methods of Data Mining
Author :
Saleem, Asma ; Asif, Khadim Hussain ; Ali, Ahmad ; Awan, Shahid Mahmood ; Alghamdi, Mohammed A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Eng. & Technol., Lahore, Pakistan
fYear :
2014
fDate :
8-11 Dec. 2014
Firstpage :
451
Lastpage :
456
Abstract :
Data generation, handling and its processing have emerged as the most reliable source of understanding and discovery of new facts, knowledge and products in the world of natural and material sciences. The emergence of the most efficient techniques in statistical or bioinformatics situations has therefore become a routine practice in research and industrial sectors. Under practical conditions, dealing with large datasets, it´s likely to have inconsistencies and anomalies of all kinds to prevent to know real outcomes for practical problems. For accurate data mining computer based techniques of data pre-processing offer solutions that help the data under processing to conform normal structures which in turn considerably improve the performance of machine learning algorithms. In this process, accurate determination of outliers, extreme values and filling up gaps poses formidable challenges. Multiple methodologies have therefore been developed to detect these deviated or inconsistent values called outliers. Different data pre-processing techniques discussed in this paper could offer most suitable solutions for handling missing values and outliers in all kinds of large datasets such as electric load and weather datasets.
Keywords :
bioinformatics; data handling; data mining; learning (artificial intelligence); statistical analysis; bioinformatics situation; data generation; data handling; data mining computer based technique; data processing; electric load; machine learning algorithm; multiple methodology; normal structure; statistical situation; weather dataset; Algorithm design and analysis; Computer science; Data mining; Databases; Educational institutions; Filling; Meteorology; data mining; data pre-processing; missing values; outliers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Utility and Cloud Computing (UCC), 2014 IEEE/ACM 7th International Conference on
Conference_Location :
London
Type :
conf
DOI :
10.1109/UCC.2014.57
Filename :
7027524
Link To Document :
بازگشت