• DocumentCode
    3521806
  • Title

    Unsupervised speaker adaptation method based on hierarchical spectral clustering

  • Author

    Furui, Sadaoki

  • Author_Institution
    Human Interface Lab., NTT, Tokyo, Japan
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    286
  • Abstract
    An automatic speaker adaptation method is proposed for speech recognition in which a small amount of training material of unspecified text can be used. This method is easily applicable to vector-quantization-based speech recognition systems where each word is represented as multiple sequences of codebook entries. In the adaptation algorithm, either the codebook is modified for each new speaker or input speech spectra are adapted to the codebook, thereby using codebook sequences universally for all speakers. The important feature of this algorithm is that a set of spectra in training frames and the codebook entries are clustered hierarchically. Based on the deviation vectors between centroids of the training frame clusters and the corresponding codebook clusters, adaptation is performed hierarchically from small to large numbers of clusters. Results of recognition experiments indicate that the proposed adaptation method is highly effective. Possible variations using this method are presented
  • Keywords
    encoding; speech recognition; adaptation algorithm; automatic speaker adaptation method; centroids; codebook entries; deviation vectors; hierarchical spectral clustering; input speech spectra; recognition experiments; training material; unsupervised speaker adaptation method; vector-quantization-based speech recognition systems; Automatic speech recognition; Clustering algorithms; Dictionaries; Hidden Markov models; Humans; Laboratories; Loudspeakers; Microphones; Speech recognition; Telephone sets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266421
  • Filename
    266421