• DocumentCode
    2752069
  • Title

    Signal energy-based automatic speech splitter: a tool for developing speech corpus

  • Author

    Suyanto

  • Author_Institution
    Telkom Sch. of Eng. (STT Telkom) Jl, Bandung
  • fYear
    2007
  • fDate
    Oct. 30 2007-Nov. 2 2007
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper describes a tool, called as automatic speech splitter (ASS) used in developing speech corpus. Two problems in developing a huge speech corpus are recording process and file renaming. Recording one by one sentence and storing the resulting speech in a file name are time consuming. Solutions to those problems are record n sentences simultaneously with a certain delay between two sentences, split the results into n speeches automatically using ASS, and then store the n speeches into n desired file names. Basic notion of ASS is classifying a speech input into silence and speech segments according to its short-term signal energy rate (SER). ASS successfully split a long speech containing n sentences into exactly n one-sentence-speeches with maximum error of 50 ms, but this error is acceptable since the tolerance is 100 ms.
  • Keywords
    speech processing; automatic speech splitter; signal energy rate; speech corpus; Costs; Delay effects; Humans; Informatics; Java; Microphones; Power engineering and energy; Software tools; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2007 - 2007 IEEE Region 10 Conference
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4244-1272-3
  • Electronic_ISBN
    978-1-4244-1272-3
  • Type

    conf

  • DOI
    10.1109/TENCON.2007.4428892
  • Filename
    4428892