Title :
Toward full-text searching middleware over hierarchical documents
Author :
Kun Ma ; Bo Yang ; Abraham, Ajith
Author_Institution :
Shandong Provincial Key Lab. of Network Based Intell. Comput., Univ. of Jinan, Jinan, China
Abstract :
Currently, full-text searching can benefit from the emerging NoSQL databases and traditional indexing tools in the big data era. However, there are some drawbacks of current solutions. On one hand, the indexing documents lack of the hierarchy. On the other hand, big data have become the bottleneck of full-text searching. In the context of big data, we design a full-text searching middleware over hierarchical documents. We discuss the architecture of this middleware in detail. In addition, we propose a structure-independent hierarchical document model to present the hierarchical document. Moreover, the transformation engine is designed to translate the rich files into models. The core log event listener is responsible for capturing the changed documents and push them to the indexing storage at the same time. The experimental results show that our middleware is more advantageous than RDBMS with indexes and RDBMS with Lucene solutions.
Keywords :
document handling; indexing; information retrieval; middleware; relational databases; Lucene solutions; NoSQL databases; RDBMS; big data era; full-text searching middleware; indexing documents; indexing tools; log event listener; structure-independent hierarchical document model; Engines; Indexes; Middleware; Open source software; Real-time systems; Full-text searching; NoSQL; hierarchical documents; middleware;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2013 13th International Conference on
Conference_Location :
Bangi
Print_ISBN :
978-1-4799-3515-4
DOI :
10.1109/ISDA.2013.6920734