Title :
Study on Log-Based Change Data Capture and Handling Mechanism in Real-Time Data Warehouse
Author :
Shi, Jingang ; Bao, Yubin ; Leng, FangLing ; Yu, Ge
Author_Institution :
Dept. of Comput. Sci., Northeastern Univ., Shenyang
Abstract :
This paper proposes a framework of change data capture and data extraction, which captures changed data based on the log analysis and processes the captured data further to improve the quality of data. Then processed data are pushed to a data queue and the system processes the data queue using priority-based scheduling algorithm. Ultimately processed data are loaded to real-time data warehouse to support decision analysis. After analysis of a test case, this method can capture all changed data coming from the source data in time without changing the structure of the source system, and has a little impact on system performance to the source system. In addition, the real-time scheduling algorithm can effectively improve the data quality and data freshness of the real-time data warehouse to give a better data support for business´s routine tactical decision.
Keywords :
data handling; data warehouses; data extraction; decision analysis; handling mechanism; log-based change data capture; priority-based scheduling algorithm; real-time data warehouse; Algorithm design and analysis; Computer science; Data analysis; Data mining; Data warehouses; Databases; Dictionaries; Information analysis; Scheduling algorithm; Software engineering;
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
DOI :
10.1109/CSSE.2008.926