Title :
Macrophone: an American English telephone speech corpus for the Polyphone project
Author :
Bernstein, J. ; Taussig, Kelsey ; Godfrey, Jack
Author_Institution :
Speech Res. & Technol Program, SRI Int., Menlo Park, CA, USA
Abstract :
Macrophone is a corpus of approximately 200000 utterances, recorded over the telephone from a broad sample of about 5000 American speakers. Sponsored by the Linguistic Data Consortium (LDC), it is the first of a series of similar data sets that will be collected for major languages of the world in a cooperative project called Polyphone. It is designed to provide telephone speech suitable for the development of automatic voice-interactive telephone services. In particular, Macrophone contains training material for applications in transportation, scheduling, ticketing, database access, shopping, and other automated telephone interactions. In addition to being phonetically balanced, the spoken material refers to times, locations, monetary amounts, and interactive operations. The utterances are spoken by respondents into telephone handsets and recorded directly in 8-bit mu-law digital form through a T1 connection to the usual switched telephone network. The paper describes the design of the linguistic materials in the corpus, and the process of solicitation, collection, transcription, and file preparation for the Macrophone corpus
Keywords :
interactive systems; speech recognition; telephony; 8 bit; American English; Linguistic Data Consortium; Macrophone corpus; Polyphone; T1 connection; automatic voice-interactive telephone services; data sets; database access; interactive operations; linguistic materials; mu-law digital recording; polyphone project; scheduling; shopping; solicitation; switched telephone network; telephone handsets; telephone speech; telephone speech corpus; ticketing; training material; transportation; Databases; Instruments; Job shop scheduling; Lifting equipment; Natural languages; Rail transportation; Speech; Telephone sets; Telephony; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389350