• DocumentCode
    1769284
  • Title

    Open domain continuous filipino speech recognition with code-switching

  • Author

    Ang, Federico ; Miyanaga, Yoshikazu ; Guevara, Rowena Cristina ; Cajote, Rhandley ; Bayona, Michael Gringo Angelo

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Hokkaido Univ., Sapporo, Japan
  • fYear
    2014
  • fDate
    1-5 June 2014
  • Firstpage
    2301
  • Lastpage
    2304
  • Abstract
    It is widely known that database quality has a huge impact on speech recognition system performance, most especially when the expected domain is well represented. In this paper, we use this idea as leverage for a data-driven solution to the problem of code-switching in Filipino. Practical Filipino conversations often contain English and other loan words in varying frequencies, demanding better training of parameters and models for its speech recognition system. We alleviate the underrepresentation of loan words through the development of a new speech database for training, and applied appropriate data analysis to make reliable evaluation results. The best system was searched via lattice rescoring from a cross-validation set containing almost three hours of unknown speech data. The description and results of our experiments serve as a new and competent baseline model for succeeding future developments.
  • Keywords
    audio databases; natural language processing; speech recognition; code switching; competent baseline model; data analysis; data driven solution; database quality; open domain continuous Filipino speech recognition; practical Filipino conversations; speech database; speech recognition system performance; unknown speech data; Acoustics; Databases; Decoding; Hidden Markov models; Speech; Speech recognition; Training; Filipino speech; HMM; automatic speech recognition; code-switching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (ISCAS), 2014 IEEE International Symposium on
  • Conference_Location
    Melbourne VIC
  • Print_ISBN
    978-1-4799-3431-7
  • Type

    conf

  • DOI
    10.1109/ISCAS.2014.6865631
  • Filename
    6865631