• DocumentCode
    2144459
  • Title

    HAMEX - A Handwritten and Audio Dataset of Mathematical Expressions

  • Author

    Quiniou, Solen ; Mouchére, Harold ; Saldarriaga, Sebastián Peña ; Viard-Gaud, Christian ; Morin, Emmanuel ; Petitrenaud, Simon ; Medjkoune, Sofiane

  • Author_Institution
    LINA, Univ. de Nantes, Nantes, France
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    452
  • Lastpage
    456
  • Abstract
    In this paper, we present HAMEX, a new public dataset that contains mathematical expressions available in their on-line handwritten form and in their audio spoken form. We have designed this dataset so that, given a mathematical expression, its handwritten signal and its audio signal can be used jointly to design multimodal recognition systems. Here, we describe the different steps that allowed us to acquire this dataset, from the creation of the mathematical expression corpora (including expressions from Wikipedia pages) to the segmentation and the transcription of the collected data, via the data collection process itself. Currently, the dataset contains 4 350 on-line handwritten mathematical expressions written by 58 writers, and the corresponding audio expressions (in French) spoken by 58 speakers. The ground truth is also provided both for the handwritten expressions (as INKML files with the digital ink, the symbol segmentation, and the MATHML structure) and for the audio expressions (as XML files with the transcriptions of the spoken expressions).
  • Keywords
    Web sites; XML; audio signal processing; handwriting recognition; image segmentation; HAMEX; InkML files; MathML structure; Wikipedia pages; XML flies; audio dataset; audio signal; collected data segmentation; collected data transcription; digital ink; handwritten signal; mathematical expression corpora; multimodal recognition system; online handwritten dataset; online handwritten mathematical expression; public dataset; spoken expression; symbol segmentation; Calculators; Encyclopedias; Handwriting recognition; Ink; Internet; Speech; Vocabulary; dataset; handwriting recognition; mathematical expressions; multimodality; speech recognitio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.97
  • Filename
    6065352