• DocumentCode
    3744913
  • Title

    The MGB challenge: Evaluating multi-genre broadcast media recognition

  • Author

    P Bell;M J F Gales;T Hain;J Kilgour;P Lanchantin;X Liu;A McParland;S Renals;O Saz;M Wester;P C Woodland

  • Author_Institution
    Centre for Speech Technology Research, University of Edinburgh, Edinburgh EH8 9AB, UK
  • fYear
    2015
  • Firstpage
    687
  • Lastpage
    693
  • Abstract
    This paper describes the Multi-Genre Broadcast (MGB) Challenge at ASRU 2015, an evaluation focused on speech recognition, speaker diarization, and "lightly supervised" alignment of BBC TV recordings. The challenge training data covered the whole range of seven weeks BBC TV output across four channels, resulting in about 1,600 hours of broadcast audio. In addition several hundred million words of BBC subtitle text was provided for language modelling. A novel aspect of the evaluation was the exploration of speech recognition and speaker diarization in a longitudinal setting - i.e. recognition of several episodes of the same show, and speaker diarization across these episodes, linking speakers. The longitudinal tasks also offered the opportunity for systems to make use of supplied metadata including show title, genre tag, and date/time of transmission. This paper describes the task data and evaluation process used in the MGB challenge, and summarises the results obtained.
  • Keywords
    "Training data","Speech recognition","Training","Metadata","Speech","TV"
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/ASRU.2015.7404863
  • Filename
    7404863