• DocumentCode
    1770519
  • Title

    An Automatic Speech Recognition solution with speaker identification support

  • Author

    Buzo, Andi ; Cucu, H. ; Petrica, Lucian ; Burileanu, Dragos ; Burileanu, C.

  • Author_Institution
    Speech & Dialogue Res. Lab., Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2014
  • fDate
    29-31 May 2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Automatic Speech Recognition may suffer in terms of intelligibility if the audio recording contains heterogeneous regions of multiple speakers, music or noise. Diarization is the process of segmenting an audio file into homogeneous regions and when used in conjunction with an Automatic Speech Recognition system, it filters out the non-speech audio regions and significantly improves the intelligibility of the recognition output. In this paper, we present an integrated diarization and transcription solution for the Romanian Language. The solution implements the diarization component as a processing stage in the speech recognizer front end. The integrated system is evaluated in terms of computation efficiency and transcription intelligibility.
  • Keywords
    natural language processing; speaker recognition; Romanian Language; audio file; audio recording; automatic speech recognition solution; diarization component; speaker identification support; speech recognizer; Filtering; Hidden Markov models; Real-time systems; Speaker recognition; Speech; Speech processing; Speech recognition; automatic speech recognition; diarization; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (COMM), 2014 10th International Conference on
  • Conference_Location
    Bucharest
  • Type

    conf

  • DOI
    10.1109/ICComm.2014.6866674
  • Filename
    6866674