Title :
Combining open vocabulary recognition and word confusion networks
Author_Institution :
Cambridge Univ., Cambridge
fDate :
March 31 2008-April 4 2008
Abstract :
A limitation of most speech recognizers is that they only recognize words from a fixed vocabulary. In this paper, we explore a technique for addressing this deficiency using automatically derived units made up of letters and phones. We show how these units can be used for letter-to-phone conversion and open-vocabulary recognition. We further show how these units can be merged to form novel words while maintaining a word lattice structure. This allows creation of a word confusion network containing both in- and out-of-vocabulary (OOV) words. Experiments show these open vocabulary confusion networks improve recognition accuracy. They also allow open vocabulary recognition to be used in concert with a convenient confusion network result representation.
Keywords :
speech recognition; vocabulary; in-vocabulary words; letter-to-phone conversion; open vocabulary recognition; out-of-vocabulary words; speech recognizers; word confusion network; word confusion networks; word lattice structure; Automatic speech recognition; Buildings; Dictionaries; Error analysis; Laboratories; Lattices; Merging; Natural languages; Speech recognition; Vocabulary; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518612