• DocumentCode
    3038043
  • Title

    Automatic thresholding for voicing detection algorithms

  • Author

    Neuburg, Edward P.

  • Author_Institution
    Department of Defense, Meade, Md
  • Volume
    4
  • fYear
    1979
  • fDate
    28946
  • Firstpage
    756
  • Lastpage
    758
  • Abstract
    Automatic voicing-decision algorithms depend on thresholds which are dependent on speaker, channel, S/N ratio, etc. Low-frequency energy (LFE) is one of the best voicing statistics when properly thresholded; it is even better if two thresholds are set, one for onset of voicing and one for offset. Two schemes are proposed for adaptive, estimation of thresholds. The first is finding stretches that are "surely" voiced or unvoiced, finding boundaries by heuristic algorithms, and setting thresholds consistent with these boundaries, in the second, one finds segments that are "surely" voiced or unvoiced according to voicing statistics other than LFE, using these to form estimates of the distribution of LFE in voiced and unvoiced cases. Both schemes successfully determine speaker-dependent thresholds in about 15 seconds, during which "standard" thresholds can be used. Overall voicing error rate using LFE with adaptive thresholds is about 1%.
  • Keywords
    Adaptive estimation; Ash; Detection algorithms; Energy measurement; Error analysis; Frequency estimation; Solids; Speech; Statistical distributions; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1979.1170782
  • Filename
    1170782