• DocumentCode
    2702485
  • Title

    LSM-Based Unit Pruning for Concatenative Speech Synthesis

  • Author

    Bellegarda, J.R.

  • Author_Institution
    Speech & Language Technol., Apple Comput. Inc., Cupertino, CA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the (traditionally separate) problems of outliers and redundant units. Experimental results underscore the viability of this approach for both moderate and aggressive inventory pruning.
  • Keywords
    feature extraction; speech synthesis; LSM-based unit pruning; concatenative text-to-speech synthesis; data-driven feature extraction framework; inventory pruning; latent semantic mapping; natural synthetic speech; Character generation; Cost function; Databases; Degradation; Feature extraction; Inventory management; Natural languages; Signal generators; Signal synthesis; Speech synthesis; Text-to-speech synthesis; inventoru pruning; outlier removal; unit redundancy management; unit selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366964
  • Filename
    4218152