DocumentCode
24195
Title
Text Search of Surnames in Some Slavic and Other Morphologically Rich Languages Using Rule Based Phonetic Algorithms
Author
Zahoransky, Dusan ; Polasek, Ivan
Author_Institution
Software Eng., Slovak Univ. of Technol. (FIIT STU), Bratislava, Slovakia
Volume
23
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
553
Lastpage
563
Abstract
Surnames play a key role as person natural identifiers, essentially in present information systems. This paper deals with the topic of optimizing a phonetic search algorithm as a string matching of surnames usable for communications service providers, person registries, social networks or genealogy databases. It describes a proposed solution for the phonetic searching of Slovak and (territorial) neighboring languages (Czech, Polish, Ukrainian, Russian, German, Hungarian, Jewish) surnames. This solution was designed to improve search precision and recall when searching for people by their surnames originating in these languages.
Keywords
information retrieval; natural language processing; Czech language; German language; Hungarian language; Jewish language; Polish language; Russian language; Slovak language; Ukrainian language; communications service providers; genealogy databases; information systems; morphologically rich language; natural identifiers; person registries; phonetic search algorithm; rule based phonetic algorithm; search precision; search recall; slavic language; social networks; surname string matching; surname text search; Algorithm design and analysis; Databases; Europe; IEEE transactions; Materials; Speech; Speech processing; Algorithms; information retrieval; natural languages;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2015.2393393
Filename
7012074
Link To Document