• DocumentCode
    2908681
  • Title

    A Comparative Analysis of the Influence of Methods for Outliers Detection on the Performance of Data Driven Models

  • Author

    Di Bella, A. ; Fortuna, L. ; Graziani, Salvatore ; Napoli, G. ; Xibilia, M.G.

  • Author_Institution
    Univ. degli Studi di Catania, Catania
  • fYear
    2007
  • fDate
    1-3 May 2007
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper we describe, test, and compare the performance of a number of techniques used for outlier detection to improve modeling capabilities of soft sensors on the basis of the quality of available data. We analyze methods based on standard deviation of population, on residuals of a linear input-output regression, on the structure correlation of the data, on principal components and partial least squares (both linear and nonlinear) in multi dimensional space (2D, 3D, 4D), on Q and T2 statistics, on the distance of each observation from the mean of the data, and on the Mahalanobis distance. We apply techniques for outlier detection both on a fictitious model data and on real data acquired from a sulfur recovery unit of a refinery. We show that outlier removal almost always improves modeling capabilities of considered techniques.
  • Keywords
    industrial plants; measurement theory; process monitoring; sensors; statistical analysis; Mahalanobis distance; Q statistics; T2 statistics; data driven models; data structure correlation; linear input-output regression; multidimensional space; outlier detection; partial least squares; refinery sulfur recovery unit; soft sensors; Computer industry; Industrial control; Instrumentation and measurement; Least squares methods; Performance analysis; Refining; Sensor phenomena and characterization; Software measurement; Statistics; Testing; empirical nonlinear modeling; outlier detection; refinery; soft sensors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Instrumentation and Measurement Technology Conference Proceedings, 2007. IMTC 2007. IEEE
  • Conference_Location
    Warsaw
  • ISSN
    1091-5281
  • Print_ISBN
    1-4244-0588-2
  • Type

    conf

  • DOI
    10.1109/IMTC.2007.379222
  • Filename
    4258091