DocumentCode
294534
Title
PhoneBook: a phonetically-rich isolated-word telephone-speech database
Author
Pitrelli, John F. ; Fong, Cynthia ; Wong, Suk H. ; Spitz, Judith R. ; Leung, Hong C.
Author_Institution
NYNEX Corp., White Plains, NY, USA
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
101
Abstract
Describes the collection of a phonetically-rich isolated-word telephone-speech database, “PhoneBook”, which was undertaken because of (1) the lack of available large-vocabulary isolated-word data, (2) anticipated continued importance of isolated-word and keyword-spotting technology to speech-recognition-based applications over the telephone, and (3) findings that continuous-speech training data is inferior to isolated-word training for isolated-word recognition. PhoneBook has nearly 8000 distinct words, selected for complete coverage of phoneme contexts enumerated using both triphones and a novel method which takes into account syllable position, lexical stress, and non-adjacent-phoneme coarticulatory effects. PhoneBook consists of more than 92000 utterances, averaging over 11 talkers for each word. A demographically-representative set of over 1300 native speakers of American English each made a single telephone call and read 75 words. The paper describes the word list design, talker enrolment procedure, recording procedure and equipment, utterance verification method, and summary statistics for PhoneBook, which will be made available through the Linguistic Data Consortium
Keywords
speech recognition; telephony; American English; PhoneBook; demographically-representative set; equipment; lexical stress; nonadjacent-phoneme coarticulatory effect; phoneme contexts; phonetically-rich isolated-word telephone-speech database; recording procedure; speech-recognition-based applications; summary statistics; syllable position; talker enrolment procedure; telephone call; triphones; utterance verification method; word list design; Aging; Databases; Filters; Isolation technology; Speech recognition; Statistics; Stress; Telephony; Tellurium; Training data; Vocabulary; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479283
Filename
479283
Link To Document