Haute couture for transcription´s next top model: text processing technologies on the catwalk

Author

Bellegarda, Jerome

fYear

2005

fDate

27-27 Nov. 2005

Firstpage

8

Lastpage

9

Abstract

Summary form only given. The top model metaphor refers to forward-looking applications expected to come of age in the next few years. They are likely to involve the large-scale automatic transcription of lectures, meetings, podcasts, oral histories, etc. In the realm of podcasting alone, free-form content searching could quickly become critical at the current growth rate of the medium. Such applications entail many speech recognition challenges, which John Hansen comprehensively covers in the first talk of the forum. But they also raise a number of issues which are somewhat orthogonal to the process of transcription itself. This talk, accordingly, serves as a catwalk presentation of the various component technologies involved. It points to recent work in each area, highlights major bottlenecks, covers promising solutions, suggests some new areas of research, and offers some perspectives regarding comparative merits and associated trade-offs

Keywords

speech recognition; text analysis; free-form content searching; large-scale automatic transcription; speech recognition; text processing technologies; Application software; Broadcast technology; Data mining; Digital audio broadcasting; Event detection; History; Large-scale systems; Speech recognition; Telephony; Text processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on

Conference_Location

San Juan

Print_ISBN

0-7803-9478-X

Electronic_ISBN

0-7803-9479-8

Type

conf

DOI

10.1109/ASRU.2005.1566464

Filename

1566464