Title :
Cross-lingual lexical language discovery from audio data using multiple translations
Author :
Stahlberg, F. ; Schlippe, T. ; Vogel, S. ; Schultz, T.
Author_Institution :
Cognitive Syst. Lab., Karlsruhe Inst. of Technol., Karlsruhe, Germany
Abstract :
Zero-resource Automatic Speech Recognition (ZR ASR) addresses target languages without given pronunciation dictionary, transcribed speech, and language model. Lexical discovery for ZR ASR aims to extract word-like chunks from speech. Lexical discovery benefits from the availability of written translations in another source language. In this paper, we improve lexical discovery even more by combining multiple source languages. We present a novel method for combining noisy word segmentations resulting in up to 11.2% relative F-score gain. When we extract word pronunciations from the combined segmentations to bootstrap an ASR system, we improve accuracy by 9.1% relative compared to the best system with only one translation, and by 50.1% compared to monolingual lexical discovery.
Keywords :
natural language processing; speech recognition; audio data; cross lingual lexical language discovery; lexical discovery; multiple language translation; multiple source language; noisy word segmentation; word-like chunks; zero resource automatic speech recognition; Acoustics; Automatic speech recognition; Computational modeling; Dictionaries; Speech; Zirconium; Lexical language discovery; non-written languages; word-to-phoneme alignment; zero-resource automatic speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7179088