Author/Authors :
YAVANOĞLU, Uraz Gazi Üniversitesi - Mühendislik Fakültesi - Bilgisayar Mühendisliği Bölümü, Turkey , SAĞIROĞLU, Şeref Gazi Üniversitesi - Mühendislik Fakültesi - Bilgisayar Mühendisliği Bölümü, Turkey
Title Of Article :
Intelligent document language classification and web based translation system
شماره ركورد :
28169
Abstract :
Recent developments on information and communications technologies help globally and important to access, share, translate and the documents use easily and effectively via internet media. Language identification is an important task for web information retrieval services. Automatic language identification and translation have become increasingly important, as more and more documents are being served on internet within many languages. This study presents new methods to identify web contents, containing MS Word, PDF and HTML documents in different languages and to translate them into specified languages. The identification problem can be seen as a specific instance of the more general problem of an item classification through its attributes in a limited workspace. This novel approach is based on artificial neural network model to recognize the languages. Documents content belonging to 15 languages were used in test with a new testing methodology and translating them into 64 languages automatically for language processing. The results have shown that the approaches presented in this work are very successful to meet the expectations in real-time language identification and translation accuracy and reduce the number of letters in solution space in comparison with the available two methods.
From Page :
329
NaturalLanguageKeyword :
Language identification , language translation , web based application , artificial neural network
JournalTitle :
Erciyes University Journal Of The Institute Of Science an‎d Technology
To Page :
342
Link To Document :
بازگشت