• DocumentCode
    3723766
  • Title

    Development of a transformation algorithm for emotional speech signal using DWT and Adaptive Filter for a Voice Culture Training System

  • Author

    Bageshree Sathe-Pathak;Ashish Panat

  • Author_Institution
    Electronics Dept, Priyadarshini College of Engineering, Nagpur, India
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This paper develops an algorithm “Discrete Wavelet Transform with Adaptive Filter” (DWTAF) to transform Neutral speech into emotional speech like Angry, Happy or Sad and this is compared with two other emotion transformation algorithms. The other two algorithms are “Speech Transformation using Statistical Parameters and Pitch Contours” (STSPPC) and “Speech Transformation using Mel Frequency Cepstral Coefficients (MFCC) and Gaussian Mixture Model (GMM)” (STMG). STSPPC calculates statistical parameters of speech like mean and variance and they are used for transformation of emotional speech. STMG performs DTW (Dynamic Time Warping), extracts MFCC and applies a GMM for speech transformation. The proposed algorithm DWTAF, presents a novel approach to model the speech using Discrete Wavelet Transform (DWT) and emotional speech is generated by filtering the speech using an adaptive filter with Least Mean Square (LMS) algorithm. For this purpose a database of 400 English language sentences has been created with 10 sentences uttered by 10 female speakers with 4 emotions each and a comparison between the three transformation algorithms is carried out using Subjective and Objective evaluation tests. The parameters used for Objective evaluation of transformed speech are Segmental SNR (Seg SNR), Log-Likelihood Ratio (LLR) and Weighted Spectral Slope (WSS).
  • Keywords
    "Speech","Discrete wavelet transforms","Mel frequency cepstral coefficient","Adaptive filters","Databases","Training","Signal to noise ratio"
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2015 - 2015 IEEE Region 10 Conference
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-8639-2
  • Electronic_ISBN
    2159-3450
  • Type

    conf

  • DOI
    10.1109/TENCON.2015.7373010
  • Filename
    7373010