DocumentCode :
591907
Title :
A grapheme-based method for automatic alignment of speech and text data
Author :
Stan, Andrei ; Bell, P. ; King, Simon
Author_Institution :
Commun. Dept., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
286
Lastpage :
290
Abstract :
This paper introduces a method for automatic alignment of speech data with unsynchronised, imperfect transcripts, for a domain where no initial acoustic models are available. Using grapheme-based acoustic models, word skip networks and orthographic speech transcripts, we are able to harvest 55% of the speech with a 93% utterance-level accuracy and 99% word accuracy for the produced transcriptions. The work is based on the assumption that there is a high degree of correspondence between the speech and text, and that a full transcription of all of the speech is not required. The method is language independent and the only prior knowledge and resources required are the speech and text transcripts, and a few minor user interventions.
Keywords :
acoustic signal processing; natural language processing; speech synthesis; text analysis; word processing; automatic speech data alignment; automatic text data alignment; grapheme-based acoustic models; language independent method; orthographic speech transcripts; text transcription; unsynchronised-imperfect transcripts; utterance-level accuracy; word accuracy; word skip networks; Acoustics; Data models; Error analysis; Hidden Markov models; Speech; Speech recognition; Training; grapheme-based models; imperfect transcripts; speech alignment; word networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424237
Filename :
6424237
Link To Document :
بازگشت