• DocumentCode
    2576524
  • Title

    A comprehensive audio-visual corpus for teaching sound Persian phoneme articulation

  • Author

    Bastanfard, Azam ; Fazel, Maryam ; Kelishami, Alireza Abdi ; Aghaahmadi, Mohammad

  • Author_Institution
    IRIB Univ., Tehran, Iran
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    169
  • Lastpage
    174
  • Abstract
    Building an audio-visual data corpus is one significant step in audio-visual research. One of the most challenging tasks in computer science is computer-aided speech therapy and language learning. Developing computer applications for training and rehabilitation of the handicapped and helping the hearing and speaking-impaired by facial speech synthesis are among the most helpful, state-of-the-art roles of computer technology in today´s human-machine interacting systems. To date, there have been no audio-visual corpora in Persian language, in that it makes it difficult or even impossible for researchers to carry out studies in the area. This paper gives an indication of the collected Persian audio-visual data corpus. AVA is a comprehensive, systematic collection of both continuous speech and isolated spoken utterances in Persian language. The goal of this project is to facilitate audio-visual research in the language through this data corpus which is available upon request.
  • Keywords
    audio-visual systems; computer based training; educational aids; face recognition; gesture recognition; handicapped aids; human computer interaction; linguistics; patient rehabilitation; speech synthesis; teaching; AVA; comprehensive audio-visual data corpus research; computer application; computer science technology; computer-aided speech therapy; continuous speech utterance; facial speech synthesis; handicapped rehabilitation; handicapped training; hearing-impaired support; human-machine interaction system; isolated spoken utterance; language learning; lip movement identification; sound Persian phoneme articulation teaching; speaking-impaired support; Acoustical engineering; Auditory system; Computer science; Data engineering; Databases; Deafness; Education; Medical treatment; Natural languages; Speech; Audio visual data; Corpus design; Speech therapy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5346591
  • Filename
    5346591