• DocumentCode
    454707
  • Title

    Chinese Dialect Identification Using Tone Features Based on Pitch Flux

  • Author

    Ma, Bin ; Zhu, Donglai ; Tong, Rong

  • Author_Institution
    Inst. for Infocomm Res.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    This paper presents a method to extract tone relevant features based on pitch flux from continuous speech signal. The autocorrelations of two adjacent frames are calculated and the covariance between them is estimated to extract multi-dimensional pitch flux features. These features, together with MFCCs, are modeled in a 2-stream GMM models, and are tested in a 3-dialect identification task for Chinese. The pitch flux features have shown to be very effective in identifying tonal languages with short speech segments. For the test speech segments of 3 seconds, 2-stream model achieves more than 30% error reduction over MFCC-based model
  • Keywords
    Gaussian processes; natural languages; speech processing; speech recognition; Chinese dialect identification; GMM models; continuous speech signal; multidimensional pitch flux features; speech segments; tonal languages; tone features; Autocorrelation; Computer errors; Data mining; Feature extraction; Natural languages; Speaker recognition; Speech processing; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660199
  • Filename
    1660199