• DocumentCode
    672829
  • Title

    Semi-automatic syllable-like segmentation for Hindi

  • Author

    Balyan, Archana ; Agrawal, S.S. ; Dev, Amita ; Kumari, Ratnesh

  • Author_Institution
    Dept. of ECE, MSIT, New Delhi, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The goal of this study is automatic segmentation of speech at syllable level and also that the reasonable number of syllables may suffice the need for travel domain applications. This paper presents study of algorithm for identifying syllables based on linguistic rules in Hindi words. After survey of the relevant literature, a set of rules are identified and implemented as a simple easy-to-implement algorithm. The algorithm is tested on 2400 distinct words and algorithm performs with 99.5% accuracy for segmentation of written text. A baseline group delay based segmentation technique is applied on spoken speech sentences to generate labeled database at syllable level. The system is validated against a few manually segmented speech utterances. It is observed that vowels are more accurately segmented as compared to fricatives. It is seen that nearly accurate segmentation is achieved if the window scale factor is modified for each sentence.
  • Keywords
    natural languages; speech processing; text analysis; Hindi words; automatic speech segmentation; baseline group delay based segmentation technique; fricatives; labeled database generation; linguistic rules; semiautomatic syllable-like segmentation; spoken speech sentences; travel domain applications; vowels; window scale factor; written text segmentation; Databases; Delays; Histograms; Labeling; Manuals; Pragmatics; Speech; group-delay; phonology; segmentation; syllable;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709854
  • Filename
    6709854