• DocumentCode
    3428296
  • Title

    Deploying GOOG-411: Early lessons in data, measurement, and testing

  • Author

    Bacchiani, Michiel ; Beaufays, Françoise ; Schalkwyk, Johan ; Schuster, Mike ; Strope, Brian

  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    5260
  • Lastpage
    5263
  • Abstract
    We describe our early experience building and optimizing GOOG-411, a fully automated, voice-enabled, business finder. We show how taking an iterative approach to system development allows us to optimize the various components of the system, thereby progressively improving user-facing metrics. We show the contributions of different data sources to recognition accuracy. For business listing language models, we see a nearly linear performance increase with the logarithm of the amount of training data. To date, we have improved our correct accept rate by 25% absolute, and increased our transfer rate by 35% absolute.
  • Keywords
    business data processing; search engines; speech recognition; GOOG-411 fully automated voice-enabled business finder; Web-based business search engine; business listing language models; iterative approach; speech-based system; Cities and towns; Condition monitoring; Iterative methods; Network servers; Pipelines; Redundancy; Speech recognition; Speech synthesis; Testing; Training data; business finder; directory assistance; speech recognition; voice search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518846
  • Filename
    4518846