• DocumentCode
    3738572
  • Title

    Shared speech attribute augmentation for English-Tibetan cross-language phone recognition

  • Author

    Yue Zhao;Nan Zhou;Libing Zhang;Licheng Wu;Rui Zheng;Xiaoyang Wang;Qiang Ji

  • Author_Institution
    Department of Automation, Minzu University of China, Beijing
  • fYear
    2015
  • Firstpage
    539
  • Lastpage
    543
  • Abstract
    There has been a challenging research topic on exploring an universal set of speech attributes sharing among a large number of languages for detection-based bottom-up cross-language speech recognition. In some recent research works, articulatory features are used as an universal set of speech attributes shared across many different languages. Since they are defined by human as a set of semantic articulatory descriptions of phones, these manually specified attributes suffer from the incomplete capturing articulation information of all languages and are not distinctive enough for accurate phoneme recognition for cross-language transfer. In this paper, we are solving the problem of a more complete set of articulatory features representation by sparse coding method. We learned the augmented articulatory attributes which sparsely represent more speech articulation information sharing between source and target language. The augmented attributes performed the better accuracy over semantic attributes in our experiments for English-Tibetan cross-language phone recognition.
  • Keywords
    "Semantics","Speech recognition","Speech","Encoding","Hidden Markov models","Dictionaries","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2015.7394395
  • Filename
    7394395