• DocumentCode
    388378
  • Title

    Automatic labeling of speech

  • Author

    Spohrer, James C. ; Brown, Peter F. ; Roth, Robert

  • Author_Institution
    Verbex, Bedford, Mass, U.S.A.
  • Volume
    7
  • fYear
    1982
  • fDate
    30072
  • Firstpage
    1641
  • Lastpage
    1644
  • Abstract
    To evaluate the performance of a speech recognition system, large databases of labeled speech, including various speakers, noise conditions, and vocabularies, are necessary. This paper describes a method for automatically labeling speech data. In the past, speech has been labeled manually, typically by listening to and viewing waveforms through real-time, interactive computer I/O stations. This process is slow and tedious, and accounts for the shortage of large speech databases. The automatic labeling method reported here uses dynamic programming to align a script which is produced as the output of a recognition system, and a known script. The alignment gives a tentative labeling which can be refined by repeating the training, recognition, and alignment processes. The method was used to label a 50 speaker database of 140,000 digits.
  • Keywords
    Databases; Dynamic programming; Humans; Labeling; Speech analysis; Speech enhancement; Speech processing; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1982.1171490
  • Filename
    1171490