• DocumentCode
    548563
  • Title

    LVCSR Speech Database - JURISDIC

  • Author

    Demenko, Graiyna ; Grocholewski, Stefan ; Klessa, Katarzyna ; Ogorkiewicz, Jerzy ; Wagner, Agnieszka ; Lange, Marek ; Sledzinski, Daniel ; Cylwik, Natalia

  • Author_Institution
    Inst. of Linguistics, Adam Mickiewicz Univ., Poznań, Poland
  • fYear
    2008
  • fDate
    25-27 Sept. 2008
  • Firstpage
    67
  • Lastpage
    72
  • Abstract
    In the paper an overview of the Polish Speech Database for taking dictation of legal texts, created for the purpose of LVCSR system for Polish in the frame of Polish Platform for Homeland Security (PPBW) is presented. Basic information about the design of the database is provided as well as the applied method of the text corpora construction and the database structure. Fundamental details on the recording conditions and equipment are specified, followed by the description of the assessment methodology of recording quality, and the annotation specification and evaluation. Moreover, the paper contains the information about both the ongoing and planned stages of the database development process.
  • Keywords
    speech recognition; JURISDIC; LVCSR speech database; PPBW; Polish Platform for Homeland Security; database development process; database structure; text corpora construction; Databases; Java; Programming; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Algorithms, Architectures, Arrangements, and Applications (SPA), 2008
  • Conference_Location
    Poznan
  • Print_ISBN
    978-1-4577-1660-7
  • Electronic_ISBN
    978-83-62065-05-9
  • Type

    conf

  • Filename
    5967591