• DocumentCode
    1696329
  • Title

    Person name recognition in ASR outputs using continuous context models

  • Author

    Bigot, Benjamin ; Senay, Gregory ; Linares, Georges ; Fredouille, Corinne ; Dufour, Richard

  • Author_Institution
    LIA, Univ. of Avignon, Avignon, France
  • fYear
    2013
  • Firstpage
    8470
  • Lastpage
    8474
  • Abstract
    The detection and characterization, in audiovisual documents, of speech utterances where person names are pronounced, is an important cue for spoken content analysis. This paper tackles the problematic of retrieving spoken person names in the 1-Best ASR outputs of broadcast TV shows. Our assumption is that a person name is a latent variable produced by the lexical context it appears in. Thereby, a spoken name could be derived from ASR outputs even if it has not been proposed by the speech recognition system. A new context modelling is proposed in order to capture lexical and structural information surrounding a spoken name. The fundamental hypothesis of this study has been validated on broadcast TV documents available in the context of the REPERE challenge.
  • Keywords
    document handling; information retrieval; speech recognition; television broadcasting; 1-Best ASR outputs; REPERE challenge; audiovisual documents; broadcast TV documents; broadcast TV shows; context modelling; continuous context models; lexical context; person name recognition; speech recognition system; speech utterances; spoken content analysis; spoken person names retrieval; Context; Context modeling; Speech; Speech recognition; Support vector machines; TV; Vectors; lexical context representation; spoken document retrieval; spoken name detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639318
  • Filename
    6639318