• DocumentCode
    3141575
  • Title

    Samsa: a speech analysis, mining and summary application for outbound telephone calls

  • Author

    Cooper, James W. ; Viswanathan, M. ; Kazi, Z.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2001
  • fDate
    6-6 Jan. 2001
  • Abstract
    The authors applied speech recognition and text mining technologies to a set of 522 recorded outbound marketing calls and analyzed the results. Since speaker-independent speech recognition technology results in a significantly lower recognition rate than that found when the recognizer is trained for a particular speaker, we applied a number of post processing algorithms to the output of the recognizer to render it suitable for the Textract text mining system. We indexed the call transcripts using a search engine and used Textract and associated Java technologies to place the relevant terms for each document in a relational database. Following a search query, we generated a thumbnail display of the results of each call with the salient terms highlighted. We illustrate these results and discuss their utility. We describe a distinct document genre based on the notetaking concept of document content, and propose a significant new method for measuring speech recognition accuracy.
  • Keywords
    Java; computer telephony integration; data mining; marketing data processing; relational databases; search engines; speech processing; speech recognition; text analysis; Java technologies; Samsa; Textract text mining system; call transcripts; document content; document genre; notetaking concept; outbound telephone calls; post processing algorithms; recognition rate; recorded outbound marketing calls; relational database; search engine; search query; speaker-independent speech recognition technology; speech analysis; speech recognition; speech recognition accuracy measurement; summary application; text mining technologies; thumbnail display; Data mining; Displays; Information retrieval; Search engines; Speech analysis; Speech recognition; Telephony; Text analysis; Text mining; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2001. Proceedings of the 34th Annual Hawaii International Conference on
  • Conference_Location
    Maui, HI, USA
  • Print_ISBN
    0-7695-0981-9
  • Type

    conf

  • DOI
    10.1109/HICSS.2001.926472
  • Filename
    926472