DocumentCode
290114
Title
Macrophone: an American English telephone speech corpus for the Polyphone project
Author
Bernstein, J. ; Taussig, Kelsey ; Godfrey, Jack
Author_Institution
Speech Res. & Technol Program, SRI Int., Menlo Park, CA, USA
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
Macrophone is a corpus of approximately 200000 utterances, recorded over the telephone from a broad sample of about 5000 American speakers. Sponsored by the Linguistic Data Consortium (LDC), it is the first of a series of similar data sets that will be collected for major languages of the world in a cooperative project called Polyphone. It is designed to provide telephone speech suitable for the development of automatic voice-interactive telephone services. In particular, Macrophone contains training material for applications in transportation, scheduling, ticketing, database access, shopping, and other automated telephone interactions. In addition to being phonetically balanced, the spoken material refers to times, locations, monetary amounts, and interactive operations. The utterances are spoken by respondents into telephone handsets and recorded directly in 8-bit mu-law digital form through a T1 connection to the usual switched telephone network. The paper describes the design of the linguistic materials in the corpus, and the process of solicitation, collection, transcription, and file preparation for the Macrophone corpus
Keywords
interactive systems; speech recognition; telephony; 8 bit; American English; Linguistic Data Consortium; Macrophone corpus; Polyphone; T1 connection; automatic voice-interactive telephone services; data sets; database access; interactive operations; linguistic materials; mu-law digital recording; polyphone project; scheduling; shopping; solicitation; switched telephone network; telephone handsets; telephone speech; telephone speech corpus; ticketing; training material; transportation; Databases; Instruments; Job shop scheduling; Lifting equipment; Natural languages; Rail transportation; Speech; Telephone sets; Telephony; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389350
Filename
389350
Link To Document