مرکز منطقه ای اطلاع رساني علوم و فناوري - Bootstrapping a spoken language identification system using unsupervised integrated sensing and processing decision trees

DocumentCode :

3485647

Title :

Bootstrapping a spoken language identification system using unsupervised integrated sensing and processing decision trees

Author :

Huang, Shuai ; Karakos, Damianos ; Coppersmith, Glen A. ; Church, Kenneth W. ; Siniscalchi, Sabato Marco

Author_Institution :

Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2011

fDate :

11-15 Dec. 2011

Firstpage :

342

Lastpage :

347

Abstract :

In many inference and learning tasks, collecting large amounts of labeled training data is time consuming and expensive, and oftentimes impractical. Thus, being able to efficiently use small amounts of labeled data with an abundance of unlabeled data-the topic of semi-supervised learning (SSL) [1]-has garnered much attention. In this paper, we look at the problem of choosing these small amounts of labeled data, the first step in a bootstrapping paradigm. Contrary to traditional active learning where an initial trained model is employed to select the unlabeled data points which would be most informative if labeled, our selection has to be done in an unsupervised way, as we do not even have labeled data to train an initial model. We propose using unsupervised clustering algorithms, in particular integrated sensing and processing decision trees (ISPDTs) [2], to select small amounts of data to label and subsequently use in SSL (e.g. transductive SVMs). In a language identification task on the CallFriend1 and 2003 NIST Language Recognition Evaluation corpora [3], we demonstrate that the proposed method results in significantly improved performance over random selection of equivalently sized training data.

Keywords :

bootstrapping; decision trees; learning (artificial intelligence); bootstrapping; integrated sensing and processing decision trees; semisupervised learning; spoken language identification system; unsupervised clustering algorithms; unsupervised integrated sensing; Clustering algorithms; Error analysis; Feature extraction; NIST; Speech; Training data; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on

Conference_Location :

Waikoloa, HI

Print_ISBN :

978-1-4673-0365-1

Electronic_ISBN :

978-1-4673-0366-8

Type :

conf

DOI :

10.1109/ASRU.2011.6163955

Filename :

6163955

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3485647