• DocumentCode
    24195
  • Title

    Text Search of Surnames in Some Slavic and Other Morphologically Rich Languages Using Rule Based Phonetic Algorithms

  • Author

    Zahoransky, Dusan ; Polasek, Ivan

  • Author_Institution
    Software Eng., Slovak Univ. of Technol. (FIIT STU), Bratislava, Slovakia
  • Volume
    23
  • Issue
    3
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    553
  • Lastpage
    563
  • Abstract
    Surnames play a key role as person natural identifiers, essentially in present information systems. This paper deals with the topic of optimizing a phonetic search algorithm as a string matching of surnames usable for communications service providers, person registries, social networks or genealogy databases. It describes a proposed solution for the phonetic searching of Slovak and (territorial) neighboring languages (Czech, Polish, Ukrainian, Russian, German, Hungarian, Jewish) surnames. This solution was designed to improve search precision and recall when searching for people by their surnames originating in these languages.
  • Keywords
    information retrieval; natural language processing; Czech language; German language; Hungarian language; Jewish language; Polish language; Russian language; Slovak language; Ukrainian language; communications service providers; genealogy databases; information systems; morphologically rich language; natural identifiers; person registries; phonetic search algorithm; rule based phonetic algorithm; search precision; search recall; slavic language; social networks; surname string matching; surname text search; Algorithm design and analysis; Databases; Europe; IEEE transactions; Materials; Speech; Speech processing; Algorithms; information retrieval; natural languages;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2393393
  • Filename
    7012074