DocumentCode :
162623
Title :
Design and implementation of a music composition application using speech recognition
Author :
Parra, Rodrigo ; Ramirez, J. ; Abente Lahaye, Martin
Author_Institution :
Fac. Politec., Univ. Nac. de Asuncion, San Lorenzo, Paraguay
fYear :
2014
fDate :
15-19 Sept. 2014
Firstpage :
1
Lastpage :
12
Abstract :
This paper describes speech recognition as a research area, by studying its application to a non trivial problem such as music composition. The design and implementation process of TamTam Listens, a voice-user interface for music composition, is described. This application was developed using open-source tools, like the speech recognition engine PocketSphinx, the acoustic models provided by Voxforge and the Sugar learning platform. Finally, the results of a usability test performed with TamTam Listens are exposed. These results allow drawing conclusions about speech recognition applicability to user interfaces, and offering recomendations for their design and implementation.
Keywords :
acoustic signal processing; music; public domain software; speech recognition; speech-based user interfaces; PocketSphinx; Sugar learning platform; TamTam Listens; Voxforge; acoustic models; music composition application; open-source tools; speech recognition applicability; speech recognition engine; usability test; voice-user interface; Computational modeling; Electronic mail; Google; Hidden Markov models; Monitoring; Speech recognition; Sugar; Speech recognition; accessibility; e-Education; open-source; usability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing Conference (CLEI), 2014 XL Latin American
Conference_Location :
Montevideo
Type :
conf
DOI :
10.1109/CLEI.2014.6965110
Filename :
6965110
Link To Document :
بازگشت