DocumentCode :
125378
Title :
Web Service Orchestration Topic Mining
Author :
Chu, Victor W. ; Wong, Raymond K. ; Chi Hung Chi ; Hung, Patrick C. K.
Author_Institution :
Univ. of New South Wales, Sydney, NSW, Australia
fYear :
2014
fDate :
June 27 2014-July 2 2014
Firstpage :
225
Lastpage :
232
Abstract :
Due to the popularity of using web services to deliver services on the Web, a clear view of how they are being consumed is becoming critical. Researchers have been trying multiple methods to reveal actual service orchestration patterns from service logs. However, most of the discovery methods have taken deterministic approaches, and hence, they do not provide enough allowance to cater for incomplete data and noises. On the other hand, most investigations do not take combinatorial explosion into consideration leading to scalability problem. Moreover, asynchronous web service invocations and distributed executions also make it difficult to identify service patterns due to the randomness in log record generation. In this paper, probabilistic topic mining class of solutions are applied to reveal web service orchestration patterns from service logs, in which robust approximation methods are available to provide scalability. Data sparsity problem in service log is also investigated by using biterm topic model (BTM) and comparing its results with traditional latent Dirichlet allocation (LDA) model. In addition, a topic matching method is introduced based on the Hungarian method on Jensen-Shannon divergence matrix, whilst notions of aggJSD and autoJSD are also introduced to measure topic diversity between matched topic sets and within a single topic set respectively. Experiment results confirm that BTM can be used for service logs with short log entries and with sparsity larger than 90% approximately.
Keywords :
Web services; data mining; probability; BTM; Hungarian method; Jensen-Shannon divergence matrix; LDA; Web service; aggJSD; autoJSD; biterm topic model; data sparsity problem; latent dirichlet allocation; log record generation; orchestration; probabilistic topic mining class; robust approximation methods; topic matching method; topic mining; Approximation methods; Business; Data mining; Data models; Electronic mail; Probabilistic logic; Web services; data sparsity problem; orchestration; short message problem; topic model; web services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Services (ICWS), 2014 IEEE International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5053-9
Type :
conf
DOI :
10.1109/ICWS.2014.42
Filename :
6928902
Link To Document :
بازگشت