DocumentCode
1184972
Title
Dynamic querying of streaming data with the dQUOB system
Author
Plale, Beth ; Schwan, Karsten
Author_Institution
Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA
Volume
14
Issue
4
fYear
2003
fDate
4/1/2003 12:00:00 AM
Firstpage
422
Lastpage
432
Abstract
Data streaming has established itself as a viable communication abstraction in data-intensive parallel and distributed computations, occurring in applications such as scientific visualization, performance monitoring, and large-scale data transfer. A known problem in large-scale event communication is tailoring the data received at the consumer. It is the general problem of extracting data of interest from a data source, a problem that the database community has successfully addressed with SOL queries, a time tested, user-friendly way for noncomputer scientists to access data. By leveraging the efficiency of query processing provided by relational queries, the dQUOB system provides a conceptual relational data model and SOL query access over streaming data. Queries can be used to extract data, combine streams, and create new streams. The language augments queries with an action to enable more complex data transformations such as Fourier transforms. The dQUOB system has been applied to two large-scale distributed applications: a safety critical autonomous robotics simulation and scientific software visualization for global atmospheric transport modeling. In this paper, we present the dQUOB system and the results of performance evaluation undertaken to assess its applicability in data-intensive wide-area computations, where the benefit of portable data transformation must be evaluated against the cost of continuous query evaluation.
Keywords
SQL; data handling; parallel processing; query processing; relational databases; robots; SOL; atmospheric transport modeling; autonomous robotics; dQUOB system; data streams; data-intensive computations; database query processing; distributed computations; grid computing; parallel processing; publish-subscribe event channels; relational database; Atmospheric modeling; Computer applications; Concurrent computing; Data mining; Data visualization; Distributed computing; Large-scale systems; Monitoring; Query processing; Software safety;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2003.1195413
Filename
1195413
Link To Document