• DocumentCode
    141676
  • Title

    Extending statistical data quality improvement with explicit domain models

  • Author

    Solomakhina, Nina ; Hubauer, Thomas ; Lamparter, Steffen ; Roshchin, Mikhail ; Grimm, Stephan

  • Author_Institution
    Tech. Univ. of Vienna, Vienna, Austria
  • fYear
    2014
  • fDate
    27-30 July 2014
  • Firstpage
    720
  • Lastpage
    725
  • Abstract
    Automatic processing of data for the purpose of determining operating states and identifying faults has become essential for many modern industrial systems. Typical sources of this data include hundreds of sensors mounted at the industrial machinery measuring qualities such as temperature, vibration, pressure, and many more. However, sensors are complex technical devices, which means that they can fail and their readings may contain noise or imprecise values. Such low quality data makes it hard to solve the original task of assessing system and process status. We present an approach which brings together several well-known techniques from computer science and statistics and enhances monitoring of technical systems by improving results of detection and correction of data quality issues in sensor data. The application domain and the dependencies between its objects are represented as a knowledge-based model, while statistics identifies data anomalies, such as outlying or missing values, in sensor measurement data. Combining information from the knowledge-based model and statistical computations allows to validate and improve data analysis results. We demonstrate the proposed approach on a real-world industrial use case from the power generation domain. Our evaluation shows that the combined solution improves precision indexes while maintaining high accuracy and recall values.
  • Keywords
    ontologies (artificial intelligence); sensor fusion; statistical analysis; application domain; automatic data processing; computer science; data analysis results; data anomalies; data quality issue correction; data quality issue detection; explicit domain models; fault identification; industrial machinery; industrial systems; knowledge-based model; ontology; operating states; power generation domain; pressure measurement; quality measurement; sensor data; sensor measurement data; sensors; statistical computations; statistical data quality improvement; statistics; technical system monitoring; temperature measurement; vibration measurement; Data models; OWL; Ontologies; Sensor phenomena and characterization; Temperature measurement; Turbines; data quality; industrial control; knowledge-based methods; statistics; time series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Informatics (INDIN), 2014 12th IEEE International Conference on
  • Conference_Location
    Porto Alegre
  • Type

    conf

  • DOI
    10.1109/INDIN.2014.6945602
  • Filename
    6945602