• DocumentCode
    2403579
  • Title

    Time-frequency processing of partials for high-quality speech synthesis

  • Author

    Ciobanu, Amelia ; Negrescu, Cristian ; Stanomir, Dumitru ; Burileanu, Dragos

  • Author_Institution
    Telecommun. & Inf. Technol., Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2009
  • fDate
    18-21 June 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Based on the particularities offered by the chosen signal model, we introduce a novel approach regarding the chain of actions pursued in the analysis stage of the speech signal, which succeeds the level of partial extraction. According to the harmonic plus noise model (HNM) a number of successive estimation and synthesis operations are performed. The present paper proposes a method to enhance the harmonic parameters estimation. This new algorithm proves to have good behavior offering support in selecting an appropriate subset of partials. In addition to reducing the arithmetic complexity of the harmonic synthesis (which is known to be the most resource consuming module), this optimized selection of the partials allows us to perform a specific frequency correction, which enables the possibility of a simple and coherent future pitch manipulation. The performed experiments confirmed the expected complexity reduction. Moreover, we applied the proposed algorithm for partial selection and tracking followed by a frequency aligning of the harmonic components. The reconstructed signal compared to the original speech proved to be a perceptually indistinguishable replica.
  • Keywords
    feature extraction; parameter estimation; speech enhancement; speech synthesis; time-frequency analysis; tracking; frequency alignment; harmonic plus noise model; parameters estimation; partial extraction; speech enhancement; speech synthesis; time-frequency processing; tracking procedure; Frequency estimation; Information technology; Signal analysis; Signal processing; Signal synthesis; Speech analysis; Speech enhancement; Speech processing; Speech synthesis; Time frequency analysis; HNM model; partial tracking; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Technology and Human-Computer Dialogue, 2009. SpeD '09. Proceedings of the 5-th Conference on
  • Conference_Location
    Constant
  • Print_ISBN
    978-1-4244-4727-5
  • Type

    conf

  • DOI
    10.1109/SPED.2009.5156175
  • Filename
    5156175