• DocumentCode
    162623
  • Title

    Design and implementation of a music composition application using speech recognition

  • Author

    Parra, Rodrigo ; Ramirez, J. ; Abente Lahaye, Martin

  • Author_Institution
    Fac. Politec., Univ. Nac. de Asuncion, San Lorenzo, Paraguay
  • fYear
    2014
  • fDate
    15-19 Sept. 2014
  • Firstpage
    1
  • Lastpage
    12
  • Abstract
    This paper describes speech recognition as a research area, by studying its application to a non trivial problem such as music composition. The design and implementation process of TamTam Listens, a voice-user interface for music composition, is described. This application was developed using open-source tools, like the speech recognition engine PocketSphinx, the acoustic models provided by Voxforge and the Sugar learning platform. Finally, the results of a usability test performed with TamTam Listens are exposed. These results allow drawing conclusions about speech recognition applicability to user interfaces, and offering recomendations for their design and implementation.
  • Keywords
    acoustic signal processing; music; public domain software; speech recognition; speech-based user interfaces; PocketSphinx; Sugar learning platform; TamTam Listens; Voxforge; acoustic models; music composition application; open-source tools; speech recognition applicability; speech recognition engine; usability test; voice-user interface; Computational modeling; Electronic mail; Google; Hidden Markov models; Monitoring; Speech recognition; Sugar; Speech recognition; accessibility; e-Education; open-source; usability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing Conference (CLEI), 2014 XL Latin American
  • Conference_Location
    Montevideo
  • Type

    conf

  • DOI
    10.1109/CLEI.2014.6965110
  • Filename
    6965110