• DocumentCode
    290114
  • Title

    Macrophone: an American English telephone speech corpus for the Polyphone project

  • Author

    Bernstein, J. ; Taussig, Kelsey ; Godfrey, Jack

  • Author_Institution
    Speech Res. & Technol Program, SRI Int., Menlo Park, CA, USA
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    Macrophone is a corpus of approximately 200000 utterances, recorded over the telephone from a broad sample of about 5000 American speakers. Sponsored by the Linguistic Data Consortium (LDC), it is the first of a series of similar data sets that will be collected for major languages of the world in a cooperative project called Polyphone. It is designed to provide telephone speech suitable for the development of automatic voice-interactive telephone services. In particular, Macrophone contains training material for applications in transportation, scheduling, ticketing, database access, shopping, and other automated telephone interactions. In addition to being phonetically balanced, the spoken material refers to times, locations, monetary amounts, and interactive operations. The utterances are spoken by respondents into telephone handsets and recorded directly in 8-bit mu-law digital form through a T1 connection to the usual switched telephone network. The paper describes the design of the linguistic materials in the corpus, and the process of solicitation, collection, transcription, and file preparation for the Macrophone corpus
  • Keywords
    interactive systems; speech recognition; telephony; 8 bit; American English; Linguistic Data Consortium; Macrophone corpus; Polyphone; T1 connection; automatic voice-interactive telephone services; data sets; database access; interactive operations; linguistic materials; mu-law digital recording; polyphone project; scheduling; shopping; solicitation; switched telephone network; telephone handsets; telephone speech; telephone speech corpus; ticketing; training material; transportation; Databases; Instruments; Job shop scheduling; Lifting equipment; Natural languages; Rail transportation; Speech; Telephone sets; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389350
  • Filename
    389350