• DocumentCode
    185627
  • Title

    Experience Report: Verifying Data Interaction Coverage to Improve Testing of Data-Intensive Systems: The Norwegian Customs and Excise Case Study

  • Author

    Sen, Satyaki ; Ieva, Carlo ; Sarkar, Anirban ; Sander, Atle ; Grime, Astrid

  • Author_Institution
    Simula Res. Lab., Certus V&V Center, Oslo, Norway
  • fYear
    2014
  • fDate
    3-6 Nov. 2014
  • Firstpage
    223
  • Lastpage
    234
  • Abstract
    Testing data-intensive systems is paramount to increase our reliance on information processed in e-governance, scientific/ medical research, and social networks. A common practice in the industrial testing process is to use test databases copied from live production streams to test functionality of complex database applications that manage well-formedness of data and its adherence to business rules in these systems. This practice is often based on the assumption that the test database adequately covers realistic scenarios to test, hopefully, all functionality in these applications. There is a need to systematically evaluate this assumption. We present a tool-supported method to model realistic scenarios and verify whether copied test databases actually cover them and consequently facilitate adequate testing. We conceptualize realistic scenarios as data interactions between fields cross-cutting a complex database schema and model them as test cases in a classification tree model. We present a human-in the-loop tool, DEPICT, that uses the classification tree model as input to (a) facilitate interactive selection of a connected sub graph from often many possible paths of interactions between tables specified in the model (b) automatically generate SQL queries to create an inner join between tables in the connected sub graph (c) extract records from the join and generate a visual report of satisfied and unsatisfied interactions hence quantifying test adequacy of the test database. We report our experience as a qualitative evaluation of approach and with a large industrial database from the Norwegian Customs and Excise information system TVINN featuring large and complex databases with millions of records.
  • Keywords
    database management systems; pattern classification; program testing; program verification; trees (mathematics); DEPICT; Norwegian customs; SQL queries; TVINN; business rules; classification tree model; complex database applications; complex database schema; connected subgraph; data interaction coverage verification; data-intensive systems testing; e-governance; excise information system; experience report; functionality testing; human-in the-loop tool; industrial testing process; interactive selection; medical research; scientific research; social networks; test cases; test databases; tool-supported method; Business; Data models; Databases; Finance; Industries; Manuals; Testing; classification tree modelling; coverage; data interaction; data-intensive systems; relational databases; testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Reliability Engineering (ISSRE), 2014 IEEE 25th International Symposium on
  • Conference_Location
    Naples
  • ISSN
    1071-9458
  • Print_ISBN
    978-1-4799-6032-3
  • Type

    conf

  • DOI
    10.1109/ISSRE.2014.22
  • Filename
    6982629