DocumentCode
548563
Title
LVCSR Speech Database - JURISDIC
Author
Demenko, Graiyna ; Grocholewski, Stefan ; Klessa, Katarzyna ; Ogorkiewicz, Jerzy ; Wagner, Agnieszka ; Lange, Marek ; Sledzinski, Daniel ; Cylwik, Natalia
Author_Institution
Inst. of Linguistics, Adam Mickiewicz Univ., Poznań, Poland
fYear
2008
fDate
25-27 Sept. 2008
Firstpage
67
Lastpage
72
Abstract
In the paper an overview of the Polish Speech Database for taking dictation of legal texts, created for the purpose of LVCSR system for Polish in the frame of Polish Platform for Homeland Security (PPBW) is presented. Basic information about the design of the database is provided as well as the applied method of the text corpora construction and the database structure. Fundamental details on the recording conditions and equipment are specified, followed by the description of the assessment methodology of recording quality, and the annotation specification and evaluation. Moreover, the paper contains the information about both the ongoing and planned stages of the database development process.
Keywords
speech recognition; JURISDIC; LVCSR speech database; PPBW; Polish Platform for Homeland Security; database development process; database structure; text corpora construction; Databases; Java; Programming; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Algorithms, Architectures, Arrangements, and Applications (SPA), 2008
Conference_Location
Poznan
Print_ISBN
978-1-4577-1660-7
Electronic_ISBN
978-83-62065-05-9
Type
conf
Filename
5967591
Link To Document