Title :
Web based machine learning for language identification and translation
Author :
Sagiroglu, Seref ; Yavanoglu, Uraz ; Guven, Esra Nergis
Author_Institution :
Gazi Univ., Ankara
Abstract :
Language identification is an important task for Web information retrieval services. This paper presents the implementation of a platform for language identification in multi-lingual documents on Web. The platform consists of five modules to achieve the tasks automatically. Furthermore, artificial neural networks were used for the identification of languages in multi-lingual documents. Results for six languages including Turkish, French, Italian, Danish and Deutsch are present. The major benefit of the approach is that the ANN based language identification system could meet the expectations in real-time language identification accuracy with the help of a developed system. Experiments have shown that system achieves the tasks in high accuracy in discriminating different languages and converting them other languages on Web pages.
Keywords :
Internet; Web sites; document handling; information retrieval; language translation; learning (artificial intelligence); natural language processing; neural nets; Web based machine learning; Web information retrieval services; Web pages; artificial neural networks; language translation; multilingual documents; real-time language identification accuracy; Application software; Artificial neural networks; Biological system modeling; Computational modeling; Computer architecture; Computer networks; Hidden Markov models; Machine learning; Natural languages; Web pages;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.27