Title :
Incremental data feed maintenance of a data warehouse system derived from multiple autonomous data sources
Author :
Xu, Wei ; Li, Maoqing ; Wu, Shunxiang ; Zhu, Shunzhi ; Wang, Zhoujing ; Miao, Kehua ; Wang, Ying
Author_Institution :
Dept. of Autom., Xiamen Univ., China
Abstract :
The data acquisition process, in which the data warehouse and operational data store (ODS) are populated from operational sources, represents the most technically challenging part of any business intelligence (BI) environment. Some industry experts estimate that 60 to 80 percent of a Bl project´s effort is spent on this process alone. Nevertheless, most of the previous development work is to trigger visual GUIs tools such as Informtica manually, enter properties and drive ETL process. However, the purpose of this article is to introduce a better on-demand means to pull data from modern heterogeneous data sources with the integration of Informatica, Oracle and Korn shell script. We introduced a practical production instance on how to accomplish an efficient, scalable, controllable and maintainable ETL (extract, transform, loading) architecture. Within this infrastructure, we adopt two new techniques: process synchronization control (PSC) and time range control (TRC).
Keywords :
control engineering computing; data acquisition; data warehouses; graphical user interfaces; process control; Informatica integration; Korn shell script; Oracle; business intelligence environment; data acquisition process; data warehouse system; incremental data feed maintenance; multiple autonomous data sources; operational data store; process synchronization control; time range control; visual GUI tools; Automatic control; Automation; Bismuth; Control systems; Data acquisition; Data mining; Data warehouses; Feeds; Process control; Production; ETL; Informatica; Process synchronization control (PSC); Time range control (TRC);
Conference_Titel :
Control and Automation, 2005. ICCA '05. International Conference on
Print_ISBN :
0-7803-9137-3
DOI :
10.1109/ICCA.2005.1528287