• DocumentCode
    1402259
  • Title

    Statistical Feature Selection From Massive Data in Distribution Fault Diagnosis

  • Author

    Cai, Yixin ; Chow, Mo-Yuen ; Lu, Wenbin ; Li, Lexin

  • Author_Institution
    Dept. of Electr. & Comput. Eng., North Carolina State Univ., Raleigh, NC, USA
  • Volume
    25
  • Issue
    2
  • fYear
    2010
  • fDate
    5/1/2010 12:00:00 AM
  • Firstpage
    642
  • Lastpage
    648
  • Abstract
    Selecting proper features to identify the root cause is a critical step in distribution fault diagnosis. Power engineers usually select features based on experience. However, engineers cannot be familiar with every local system, especially in fast growing regions. With the advancing information technologies and more powerful sensors, utilities can collect much more data on their systems than before. The phenomenon will be even more substantial for the anticipating Smart Grid environments. To help power engineers select features based on the massive data collected, this paper reviews two popular feature selection methods: 1) hypothesis test, 2) stepwise regression, and introduces another two: 3) stepwise selection by Akaike´s Information Criterion, and 4) LASSO/ALASSO. These four methods are compared in terms of their model requirements, data assumptions, and computational cost. With real-world datasets from Progress Energy Carolinas, this paper also evaluates these methods and compares fault diagnosis performance by accuracy, probability of detection and false alarm ratio. This paper discusses the advantages and limitations of each method for distribution fault diagnosis as well.
  • Keywords
    fault diagnosis; power distribution; regression analysis; Akaike Information Criterion; LASSO-ALASSO; Progress Energy Carolinas; computational cost; data assumptions; distribution fault diagnosis; hypothesis test; local system; massive data; model requirements; power engineers; smart grid environments; statistical feature selection; stepwise regression; Akaike´s information criteria; LASSO; classification; fault cause identification; feature selection; hypothesis test; logistic regression; power distribution systems; smart grid; stepwise regression;
  • fLanguage
    English
  • Journal_Title
    Power Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0885-8950
  • Type

    jour

  • DOI
    10.1109/TPWRS.2009.2036924
  • Filename
    5405081