• DocumentCode
    2881058
  • Title

    A new speaker change detection method for two-speaker segmentation

  • Author

    Adam, Andre G. ; Kajarekar, Sachin S. ; Hermansky, Hynek

  • Author_Institution
    OGI School of Science and Engineering, Oregon Health and Science University, Portland, USA
  • Volume
    4
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    In absence of prior information about speakers, an important step in speaker segmentation is to obtain initial estimates for training speaker models. In this paper, we present a new method for obtaining these estimates. The method assumes that a conversation must be initiated by one of the speakers. Thus one speaker model is estimated from the small segment at the beginning of the conversation and the segment that has the largest distance from the initial segment is used to train second speaker model. We describe a system based on this method and evaluate it on two different tasks: a controlled task with variations in the duration of the initial speaker segment and amount of overlapped speech and 2001 NIST Speaker Recognition Evaluation task that contains natural conversations. This system shows significant improvements over the conventional system in absence of overlapped speech on the controlled task.
  • Keywords
    Bayesian methods; Cepstral analysis; Computational modeling; Databases; NIST; Silicon; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5745511
  • Filename
    5745511