• DocumentCode
    2126102
  • Title

    Polish N-Grams and Their Correction Process

  • Author

    Ziólko, Bartosz ; Skurzok, Dawid ; Michalska, Malgorzata

  • Author_Institution
    Dept. of Electron., AGH Univ. of Sci. & Technol., Kraków, Poland
  • fYear
    2010
  • fDate
    11-13 Aug. 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Word n-gram statistics collected from over 1 300 000 000 words are presented. Eventhough they were collected from various good sources, they contain several types of errors. The paper focuses on the process of partly supervised correction of the n- grams. Types of errors are described as well as our software allowing efficient and fast corrections.
  • Keywords
    software engineering; speech recognition; Polish language; supervised correction; word n-gram statistic; Dictionaries; Electronic publishing; Encyclopedias; Internet; Software; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Ubiquitous Engineering (MUE), 2010 4th International Conference on
  • Conference_Location
    Cebu
  • Print_ISBN
    978-1-4244-7563-6
  • Type

    conf

  • DOI
    10.1109/MUE.2010.5575068
  • Filename
    5575068