• DocumentCode
    2009291
  • Title

    A survey on recent progress in the ASAT/SIRKUS paradigm

  • Author

    Siniscalchi, Sabato Marco ; Svendsen, Torbjørn ; Lee, Chin-Hui

  • Author_Institution
    Dept. of Telematics, Univ. of Enna Kore, Enna, Italy
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    465
  • Lastpage
    470
  • Abstract
    Automatic Speech Attribute Transcription (ASAT), an ITR project sponsored under the NSF grant (IIS-04-27113), and Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing (SIRKUS), a project funded by the VERDIKT programme at the Research Council of Norway, are two research projects carried out at Georgia Institute of Technology and at Norwegian University of Science and Technology, respectively, with the purpose of investigating and developing new paradigms for speech recognition that have the capability of bridging the gap between machine and human performance. These projects approach speech recognition from a more linguistic perspective: unlike traditional ASR systems, humans detect acoustic and auditory cues, weigh and combine them to form theories, and then process these cognitive hypotheses until linguistically and pragmatically consistent speech understanding is achieved. A major goal of the ASAT/SIRKUS paradigms is to develop a detection-based approach to automatic speech recognition (ASR) based on attribute detection and knowledge integration. We report on progress of these two projects on two different tasks, namely the cross-language and language universal attribute/phone recognition task, and the language identification (LID) task.
  • Keywords
    cognitive systems; information retrieval; natural language processing; speech processing; speech recognition; statistical analysis; ASAT-SIRKUS Paradigm; Georgia Institute of Technology; ITR project; NSF grant; Norwegian University of Science and Technology; VERDIKT programme; attribute detection; automatic speech attribute transcription; automatic speech recognition; cognitive hypotheses; cross-language recognition task; knowledge integration; knowledge utilization; language identification task; language universal attribute-phone recognition task; speech recognition; spoken information retrieval; statistical speech processing; Accuracy; Acoustics; Detectors; Hidden Markov models; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684480
  • Filename
    5684480