Title :
Dynamic querying of streaming data with the dQUOB system
Author :
Plale, Beth ; Schwan, Karsten
Author_Institution :
Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
fDate :
4/1/2003 12:00:00 AM
Abstract :
Data streaming has established itself as a viable communication abstraction in data-intensive parallel and distributed computations, occurring in applications such as scientific visualization, performance monitoring, and large-scale data transfer. A known problem in large-scale event communication is tailoring the data received at the consumer. It is the general problem of extracting data of interest from a data source, a problem that the database community has successfully addressed with SOL queries, a time tested, user-friendly way for noncomputer scientists to access data. By leveraging the efficiency of query processing provided by relational queries, the dQUOB system provides a conceptual relational data model and SOL query access over streaming data. Queries can be used to extract data, combine streams, and create new streams. The language augments queries with an action to enable more complex data transformations such as Fourier transforms. The dQUOB system has been applied to two large-scale distributed applications: a safety critical autonomous robotics simulation and scientific software visualization for global atmospheric transport modeling. In this paper, we present the dQUOB system and the results of performance evaluation undertaken to assess its applicability in data-intensive wide-area computations, where the benefit of portable data transformation must be evaluated against the cost of continuous query evaluation.
Keywords :
SQL; data handling; parallel processing; query processing; relational databases; robots; SOL; atmospheric transport modeling; autonomous robotics; dQUOB system; data streams; data-intensive computations; database query processing; distributed computations; grid computing; parallel processing; publish-subscribe event channels; relational database; Atmospheric modeling; Computer applications; Concurrent computing; Data mining; Data visualization; Distributed computing; Large-scale systems; Monitoring; Query processing; Software safety;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2003.1195413