Design and implementation of a music composition application using speech recognition

Author

Parra, Rodrigo ; Ramirez, J. ; Abente Lahaye, Martin

Author_Institution

Fac. Politec., Univ. Nac. de Asuncion, San Lorenzo, Paraguay

fYear

2014

fDate

15-19 Sept. 2014

Firstpage

1

Lastpage

12

Abstract

This paper describes speech recognition as a research area, by studying its application to a non trivial problem such as music composition. The design and implementation process of TamTam Listens, a voice-user interface for music composition, is described. This application was developed using open-source tools, like the speech recognition engine PocketSphinx, the acoustic models provided by Voxforge and the Sugar learning platform. Finally, the results of a usability test performed with TamTam Listens are exposed. These results allow drawing conclusions about speech recognition applicability to user interfaces, and offering recomendations for their design and implementation.

Keywords

acoustic signal processing; music; public domain software; speech recognition; speech-based user interfaces; PocketSphinx; Sugar learning platform; TamTam Listens; Voxforge; acoustic models; music composition application; open-source tools; speech recognition applicability; speech recognition engine; usability test; voice-user interface; Computational modeling; Electronic mail; Google; Hidden Markov models; Monitoring; Speech recognition; Sugar; Speech recognition; accessibility; e-Education; open-source; usability;

fLanguage

English

Publisher

ieee

Conference_Titel

Computing Conference (CLEI), 2014 XL Latin American

Conference_Location

Montevideo

Type

conf

DOI

10.1109/CLEI.2014.6965110

Filename

6965110