• DocumentCode
    3335267
  • Title

    A Smoothing Method for a Statistical String Similarity

  • Author

    Takasu, Atsuhiro ; Aihara, Kenro ; Yamada, Taizo

  • Author_Institution
    Nat. Inst. of Inf., Tokyo
  • fYear
    2007
  • fDate
    13-15 Aug. 2007
  • Firstpage
    624
  • Lastpage
    629
  • Abstract
    We often need to measure similarity between objective information when integrating information. We propose an algorithm in this paper for the Bayesian estimation of the parameters of a statistical string similarity model. To do this we need a smoothing technique for the parameter estimation, because a string similarity model usually contains many parameters. We can make a parameter estimation for the statistical similarity model by introducing a Dirichlet prior. The experimental results show that the proposed method is effective enough for approximate matching.
  • Keywords
    Bayes methods; smoothing methods; statistical analysis; Bayesian estimation; Dirichlet prior; parameter estimation; smoothing method; statistical string similarity; Bayesian methods; Character recognition; Costs; Couplings; Hidden Markov models; Optical character recognition software; Parameter estimation; Probability; Smoothing methods; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
  • Conference_Location
    Las Vegas, IL
  • Print_ISBN
    1-4244-1500-4
  • Electronic_ISBN
    1-4244-1500-4
  • Type

    conf

  • DOI
    10.1109/IRI.2007.4296690
  • Filename
    4296690