• DocumentCode
    310549
  • Title

    A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling and non-uniform units

  • Author

    Chou, Fu-chiang ; Tseng, Chiu-Yu ; Chen, Keh-Jiann ; Lee, Lin-shan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    923
  • Abstract
    This paper presents a new Chinese text-to-speech system that produces very natural and intelligible synthetic Mandarin speech based on part-of-speech analysis, prosodic modeling and non-uniform units. The distinguishing features and key technology for the system can be summarized as follows. (1) A text analysis module for word identification and tagging was developed based on part-of-speech modeling and using heuristic rules to achieve very high accuracy. (2) The required prosodic parameters for the synthetic speech are derived from a two-stage procedure. The prosodic structures of the input texts are first derived from a statistical model trained by a large speech database, and the prosodic parameters are then determined according to the structures. (3) A specially designed speech segments inventory constructed with non-uniform and pitch dependent units is used to improve the fluency and intelligibility of the system
  • Keywords
    natural languages; speech intelligibility; speech processing; speech synthesis; statistical analysis; Chinese text to speech system; heuristic rules; input texts; intelligible synthetic Mandarin speech; large speech database; natural synthetic Mandarin speech; nonuniform units; part of speech analysis; part of speech modeling; pitch dependent units; prosodic modeling; prosodic parameters; prosodic structures; speech segments inventory; statistical model; synthetic speech; system fluency; system intelligibility; text analysis module; word identification; word tagging; Application software; Databases; History; Information analysis; Information science; Microcomputers; Speech analysis; Speech synthesis; Tagging; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596087
  • Filename
    596087