Title :
Real-time synchronization of live speech with its transcription
Author :
Lertwongkhanakool, Nat ; Punyabukkana, Proadpran ; Suchato, Atiwong
Author_Institution :
Dept. of Comput. Eng., Chulalongkorn Univ., Bangkok, Thailand
Abstract :
Most of the researches in synchronization of audio and text have been focusing on the synchronization at the level of utterance. However, to generate audio books in unstructured language like Thai from live speech, a finer level of synchronization is necessary. We propose an algorithm to synchronize live speech with its corresponding transcription in real time at syllabic unit. The proposed algorithm employs the syllable endpoint detection method and the syllable landmark detection method with bandlimited intensity as features. The experiment was conducted with LOTUS datasets and the results were compared with baseline ASR-based syllable detection. We evaluated our algorithm by measuring its error through error aberration, which is the difference of the actual number of syllables and the detected syllables for each phrase, and found average total error aberration of the proposed algorithm to outperform that of the baseline. The average total error aberrations are 11.54 and 34.21 for the proposed method and the baseline respectively. We also found the reference deviation from our method to be better than that of the baseline as well.
Keywords :
speech processing; synchronisation; text analysis; ASR based syllable detection; LOTUS datasets; Thai; audio synchronization; average total error aberration; live speech synchronization; real time synchronization; reference deviation; syllabic unit; syllable endpoint detection method; syllable landmark detection method; text synchronization; unstructured language; utterance; Accuracy; Books; Feature extraction; Real-time systems; Speech; Synchronization; Training; Automatic speech-text synchronization; Endpoint Detection; Live speech and transcription alignment; Real-Time alignment; Syllable Detection;
Conference_Titel :
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2013 10th International Conference on
Conference_Location :
Krabi
Print_ISBN :
978-1-4799-0546-1
DOI :
10.1109/ECTICon.2013.6559560