DocumentCode
162623
Title
Design and implementation of a music composition application using speech recognition
Author
Parra, Rodrigo ; Ramirez, J. ; Abente Lahaye, Martin
Author_Institution
Fac. Politec., Univ. Nac. de Asuncion, San Lorenzo, Paraguay
fYear
2014
fDate
15-19 Sept. 2014
Firstpage
1
Lastpage
12
Abstract
This paper describes speech recognition as a research area, by studying its application to a non trivial problem such as music composition. The design and implementation process of TamTam Listens, a voice-user interface for music composition, is described. This application was developed using open-source tools, like the speech recognition engine PocketSphinx, the acoustic models provided by Voxforge and the Sugar learning platform. Finally, the results of a usability test performed with TamTam Listens are exposed. These results allow drawing conclusions about speech recognition applicability to user interfaces, and offering recomendations for their design and implementation.
Keywords
acoustic signal processing; music; public domain software; speech recognition; speech-based user interfaces; PocketSphinx; Sugar learning platform; TamTam Listens; Voxforge; acoustic models; music composition application; open-source tools; speech recognition applicability; speech recognition engine; usability test; voice-user interface; Computational modeling; Electronic mail; Google; Hidden Markov models; Monitoring; Speech recognition; Sugar; Speech recognition; accessibility; e-Education; open-source; usability;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing Conference (CLEI), 2014 XL Latin American
Conference_Location
Montevideo
Type
conf
DOI
10.1109/CLEI.2014.6965110
Filename
6965110
Link To Document