• DocumentCode
    3531183
  • Title

    Syllable nucleus Durations Estimation using Linear Regression based ensemble model

  • Author

    Lu, Jingli ; Wang, Rulii ; De Silva, Liyanage C. ; Gao, Yang

  • Author_Institution
    Sch. of Eng. & Adv. Technol., Massey Univ., Palmerston North
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4849
  • Lastpage
    4852
  • Abstract
    Unlike conventional automatic continuous speech segmentation models that deal with each boundary time-mark individually, in this paper, we propose an interval-data-based linear regression model for syllable nucleus durations estimation (LRM-DE), which treats syllable boundary time-marks in pairs. This characteristic of LRM-DE makes it more suitable for estimating syllable durations for English sentences, which can be used for sentence stress detection. LRM-DE combines the outcomes of multiple base automatic speech segmentation machines (ASMs) to generate final boundary time-marks that miminize the average distance of the predicted and reference boundary-pairs of syllable nuclei. Experimental results show that on TIMIT dataset, LRM-DE reduces the average difference between the predicted syllable nucleus durations and their reference ones from 13.64 ms (the best result of a single ASM) to 11.81 ms. Also, LRM-DE improves the syllable nucleus segmentation accuracy from 81.59% to 83.98% within a tolerance of 20 ms.
  • Keywords
    natural language processing; regression analysis; speech processing; English sentences; automatic continuous speech segmentation models; automatic speech segmentation machines; ensemble model; linear regression; sentence stress detection; syllable nucleus durations estimation; Linear regression; Automatic speech segmentation; ensemble model; multiple linear regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960717
  • Filename
    4960717