DocumentCode :
24195
Title :
Text Search of Surnames in Some Slavic and Other Morphologically Rich Languages Using Rule Based Phonetic Algorithms
Author :
Zahoransky, Dusan ; Polasek, Ivan
Author_Institution :
Software Eng., Slovak Univ. of Technol. (FIIT STU), Bratislava, Slovakia
Volume :
23
Issue :
3
fYear :
2015
fDate :
Mar-15
Firstpage :
553
Lastpage :
563
Abstract :
Surnames play a key role as person natural identifiers, essentially in present information systems. This paper deals with the topic of optimizing a phonetic search algorithm as a string matching of surnames usable for communications service providers, person registries, social networks or genealogy databases. It describes a proposed solution for the phonetic searching of Slovak and (territorial) neighboring languages (Czech, Polish, Ukrainian, Russian, German, Hungarian, Jewish) surnames. This solution was designed to improve search precision and recall when searching for people by their surnames originating in these languages.
Keywords :
information retrieval; natural language processing; Czech language; German language; Hungarian language; Jewish language; Polish language; Russian language; Slovak language; Ukrainian language; communications service providers; genealogy databases; information systems; morphologically rich language; natural identifiers; person registries; phonetic search algorithm; rule based phonetic algorithm; search precision; search recall; slavic language; social networks; surname string matching; surname text search; Algorithm design and analysis; Databases; Europe; IEEE transactions; Materials; Speech; Speech processing; Algorithms; information retrieval; natural languages;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2393393
Filename :
7012074
Link To Document :
بازگشت