DocumentCode
2126102
Title
Polish N-Grams and Their Correction Process
Author
Ziólko, Bartosz ; Skurzok, Dawid ; Michalska, Malgorzata
Author_Institution
Dept. of Electron., AGH Univ. of Sci. & Technol., Kraków, Poland
fYear
2010
fDate
11-13 Aug. 2010
Firstpage
1
Lastpage
5
Abstract
Word n-gram statistics collected from over 1 300 000 000 words are presented. Eventhough they were collected from various good sources, they contain several types of errors. The paper focuses on the process of partly supervised correction of the n- grams. Types of errors are described as well as our software allowing efficient and fast corrections.
Keywords
software engineering; speech recognition; Polish language; supervised correction; word n-gram statistic; Dictionaries; Electronic publishing; Encyclopedias; Internet; Software; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Ubiquitous Engineering (MUE), 2010 4th International Conference on
Conference_Location
Cebu
Print_ISBN
978-1-4244-7563-6
Type
conf
DOI
10.1109/MUE.2010.5575068
Filename
5575068
Link To Document