Title :
Customer Information System Data Pre-Processing with Feature Selection Techniques for Non-Technical Losses Prediction in an Electricity Market
Author :
Nizar, Anisah Hanim ; Jun Hua Zhao ; Zhao Yang Dong
Author_Institution :
Sch. of Inf. Technol. & Electr. Eng., Univ. of Queensland, St. Lucia, QLD
Abstract :
Non-technical losses (NTL) identification and prediction are important tasks for many utilities. Data from customer information system (CIS) can be used for NTL analysis. However, in order to accurately and efficiently perform NTL analysis, the original data from CIS need to be pre-processed before any detailed NTL analysis can be carried out. In this paper, we propose a feature selection based method for CIS data pre-processing in order to extract the most relevant information for further analysis such as clustering and classifications. By removing irrelevant and redundant features, feature selection is an essential step in data mining process in finding optimal subset of features to improve the quality of result by giving faster time processing, higher accuracy and simpler results with fewer features. Detailed feature selection analysis is presented in the paper. Both time-domain and load shape data are compared based on the accuracy, consistency and statistical dependencies between features.
Keywords :
data mining; feature extraction; power engineering computing; power markets; customer information system data pre-processing; data mining process; electricity market; feature selection techniques; non-technical losses prediction; Classification tree analysis; Computational Intelligence Society; Data mining; Electricity supply industry; Information analysis; Information systems; Information technology; Performance analysis; Power systems; Propagation losses; Data Mining; Feature Selection; Load Profiling; Non-Technical Loss (NTL) analysis;
Conference_Titel :
Power System Technology, 2006. PowerCon 2006. International Conference on
Conference_Location :
Chongqing
Print_ISBN :
1-4244-0110-0
Electronic_ISBN :
1-4244-0111-9
DOI :
10.1109/ICPST.2006.321964