مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3179637

Title :

Dealing with untranscribed speech

Author :

Prahallad, K.

Author_Institution :

Language Technol. Res. Center, Int. Inst. of Inf. Technol., Hyderabad, India

fYear :

2012

fDate :

22-25 July 2012

Firstpage :

Lastpage :

Abstract :

With the advent of social networks, there has been an exponential growth in multimedia data including speech. This speech data is typically conversational, casual and recorded in real environment. An important characteristic of this speech data is unavailability of corresponding transcripts (text) or the language information. In this work, we discuss technologies dealing with speech data without any corresponding transcripts and/or language information. A traditional way is to adopt acoustic models from existing benchmark databases (of known languages) for obtaining a first-level transcription and then perform bootstrapping. We show inherent limitations of such approaches, and argue that signal processing algorithms based on speech production knowledge play an important role in dealing with such speech data. This paper discusses some of the ongoing work at our lab in this direction which includes building audio search, speech summarization, speech synthesis and voice conversion using untranscribed speech.

Keywords :

multimedia communication; speech synthesis; audio search; benchmark databases; bootstrapping; first-level transcription; language information; multimedia data; signal processing algorithms; social networks; speech data; speech production knowledge; speech summarization; speech synthesis; untranscribed speech; voice conversion; Acoustics; Adaptation models; Buildings; Production; Signal processing algorithms; Speech; Speech processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing and Communications (SPCOM), 2012 International Conference on

Conference_Location :

Bangalore

Print_ISBN :

978-1-4673-2013-9

Type :

conf

DOI :

10.1109/SPCOM.2012.6290249

Filename :

6290249

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3179637