• DocumentCode
    2013281
  • Title

    Bibliographic Meta-Data Extraction Using Probabilistic Finite State Transducers

  • Author

    Krämer, Martin ; Kaprykowsky, Hagen ; Keysers, Daniel ; Breuel, Thomas

  • Author_Institution
    German Res. Center for Artificial Intelligence, Kaiserslautern
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    609
  • Lastpage
    613
  • Abstract
    We present the application of probabilistic finite state transducers to the task of bibliographic meta-data extraction from scientific references. By using the transducer approach, which is often applied successfully in computational linguistics, we obtain a trainable and modular framework. This results in simplicity, flexibility, and easy adaptability to changing requirements. An evaluation on the Cora dataset that serves as a common benchmark for accuracy measurements yields a word accuracy of 88.5%, afield accuracy of 82.6%, and an instance accuracy of 42.7%. Based on a comparison to other published results, we conclude that our system performs second best on the given data set using a conceptually simple approach and implementation.
  • Keywords
    bibliographic systems; computational linguistics; learning (artificial intelligence); meta data; Cora dataset; bibliographic meta-data extraction; computational linguistics; probabilistic finite state transducers; scientific references; Artificial intelligence; Computational linguistics; Data mining; Hidden Markov models; Knowledge based systems; Knowledge management; Machine learning; Project management; Search engines; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4376987
  • Filename
    4376987