• DocumentCode
    2867207
  • Title

    A Nominal Filter for Web Search Snippets: Using the Web to Identify Members of Latin America´s Highly Qualified Diaspora

  • Author

    García-Flores, Jorge ; Turner, William

  • Author_Institution
    LIMSI, Orsay, France
  • fYear
    2011
  • fDate
    Nov. 26 2011-Dec. 4 2011
  • Firstpage
    45
  • Lastpage
    50
  • Abstract
    This paper presents efforts aimed at using Natural Language Engineering (NLE) techniques for evaluating the impact of talent mobility on the development of three Latin American countries: Argentina, Colombia and Uruguay. We explain the different steps of a research program aimed at carrying out what we call a Mobility Trace Extraction Task. The first step enriches traditional person name disambiguation queries with Social Identity Markers (SIMs) extracted from the bibliographic records of the Web of Science database. The coherence of the snippets retrieved using this SIM-enriched Web search strategy is automatically verified by applying nominal filters based on a context-free grammar to eliminate those snippets which do not respect valid variations of personal names. Finally, the filtered results are ordered and presented to social scientists in a way which allows them to decide if they want to contact and interview a person or not. Our goal is to produce a computer supported infrastructure for doing this type of sociological research using data from the Web.
  • Keywords
    Web sites; bibliographies; context-free grammars; information filters; natural language processing; social sciences computing; Argentina; Colombia; Latin America highly qualified Diaspora; Latin American countries; SIM-enriched Web search strategy; Uruguay; Web search snippets; bibliographic records; context-free grammar; disambiguation queries; member identification; mobility trace extraction task; natural language engineering technique; nominal filter; social identity marker identification; Coherence; Context; Data mining; Databases; Grammar; Organizations; Web search; computer assisted sociology; highly skilled diaspora; mobility; person disambiguation; semantic filtering; web people search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence (MICAI), 2011 10th Mexican International Conference on
  • Conference_Location
    Puebla
  • Print_ISBN
    978-1-4577-2173-1
  • Type

    conf

  • DOI
    10.1109/MICAI.2011.24
  • Filename
    6118984