DocumentCode
2335292
Title
ASR and Translation for Under-Resourced Languages
Author
Besacier, L. ; Le, V.-B. ; Boitet, C. ; Berment, V.
Author_Institution
CLIPS/IMAG Lab., UJF, Grenoble
Volume
5
fYear
2006
fDate
14-19 May 2006
Abstract
There are more than 6000 languages in the world but only a small number possess the resources required for implementation of human language technologies (HLT). Thus, HLT are mostly concerned by languages for which large resources are available or which have suddenly become of interest because of the economic or political scene. On the contrary, languages from developing countries or minorities have been less worked on in the past years. One way of improving this "language divide" is do more research on portability of HLT for multilingual applications. In this paper, we concentrate on speech-to-speech translation. We present here our methodology for fast development of ASR systems for under-resourced languages or, as they are called now, pi-languages (poorly equipped). We present the resources collected for Vietnamese, and the experimental results of our first Vietnamese ASR system. The current validation of our methodology for Khmer is described next. We also discuss some issues related to machine translation and present first contributions of our laboratory in this context of "pi-languages"
Keywords
language translation; natural languages; speech recognition; automatic speech recognition; human language technologies; language translation; speech-to-speech translation; under-resourced languages; Automatic speech recognition; Dictionaries; Laboratories; Natural languages; Optical character recognition software; Printing; Speech processing; Speech synthesis; Text processing; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1661502
Filename
1661502
Link To Document