DocumentCode :
294534
Title :
PhoneBook: a phonetically-rich isolated-word telephone-speech database
Author :
Pitrelli, John F. ; Fong, Cynthia ; Wong, Suk H. ; Spitz, Judith R. ; Leung, Hong C.
Author_Institution :
NYNEX Corp., White Plains, NY, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
101
Abstract :
Describes the collection of a phonetically-rich isolated-word telephone-speech database, “PhoneBook”, which was undertaken because of (1) the lack of available large-vocabulary isolated-word data, (2) anticipated continued importance of isolated-word and keyword-spotting technology to speech-recognition-based applications over the telephone, and (3) findings that continuous-speech training data is inferior to isolated-word training for isolated-word recognition. PhoneBook has nearly 8000 distinct words, selected for complete coverage of phoneme contexts enumerated using both triphones and a novel method which takes into account syllable position, lexical stress, and non-adjacent-phoneme coarticulatory effects. PhoneBook consists of more than 92000 utterances, averaging over 11 talkers for each word. A demographically-representative set of over 1300 native speakers of American English each made a single telephone call and read 75 words. The paper describes the word list design, talker enrolment procedure, recording procedure and equipment, utterance verification method, and summary statistics for PhoneBook, which will be made available through the Linguistic Data Consortium
Keywords :
speech recognition; telephony; American English; PhoneBook; demographically-representative set; equipment; lexical stress; nonadjacent-phoneme coarticulatory effect; phoneme contexts; phonetically-rich isolated-word telephone-speech database; recording procedure; speech-recognition-based applications; summary statistics; syllable position; talker enrolment procedure; telephone call; triphones; utterance verification method; word list design; Aging; Databases; Filters; Isolation technology; Speech recognition; Statistics; Stress; Telephony; Tellurium; Training data; Vocabulary; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479283
Filename :
479283
Link To Document :
بازگشت