• DocumentCode
    591918
  • Title

    Automatic detection and correction of syntax-based prosody annotation errors

  • Author

    Brognaux, S. ; Drugman, Thomas ; Beaufort, R.

  • Author_Institution
    ICTEAM, Univ. Catholique de Louvain, Louvain-la-Neuve, Belgium
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    410
  • Lastpage
    415
  • Abstract
    Both unit-selection and HMM-based speech synthesis require large annotated speech corpora. To generate more natural speech, considering the prosodic nature of each phoneme of the corpus is crucial. Generally, phonemes are assigned labels which should reflect their suprasegmental characteristics. Labels often result from an automatic syntactic analysis, without checking the acoustic realization of the phoneme in the corpus. This leads to numerous errors because syntax and prosody do not always coincide. This paper proposes a method to reduce the amount of labeling errors, using acoustic information. It is applicable as a post-process to any syntax-driven prosody labeling. Acoustic features are considered, to check the syntax-based labels and suggest potential modifications. The proposed technique has the advantage of not requiring a manually prosody-labelled corpus. The evaluation on a corpus in French shows that more than 75% of the errors detected by the method are effective errors which must be corrected.
  • Keywords
    acoustic signal processing; hidden Markov models; natural language processing; speech synthesis; French corpus; HMM-based speech synthesis; acoustic features; acoustic information; automatic syntactic analysis; labeling error reduction; phonemes; speech corpora annotation; suprasegmental characteristics; syntax-based labels; syntax-based prosody annotation error automatic detection; syntax-based prosody annotation error correction; syntax-driven prosody labeling; unit-selection; Acoustics; Decision trees; Feature extraction; Labeling; Niobium; Speech; Syntactics; Annotation; Corpus; Prosody; Speech Synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424259
  • Filename
    6424259