Title :
ALISP-Based Data Compression for Generic Audio Indexing
Author :
Khemiri, Houssemeddine ; Delacretaz, Dijana Petrovska ; Chollet, Gerard
Author_Institution :
Inst. Mines-Telecom, Telecom SudParis, Evry, France
Abstract :
In this paper we propose a generic framework to index and retrieve audio. In this framework, audio data is transformed into a sequence of symbols using the ALISP tools. In such a way the audio data is represented in a compact way. Then an approximate matching algorithm inspired from the BLAST technique is exploited to retrieve the majority of audio items that could be present in radio stream. The evaluations of the proposed systems are done on a private radio broadcast database provided by YACAST and other publicly available corpora. The experimental results show an excellent performance in audio identification (for advertisement and songs), audio motif discovery (for advertisement and songs), speaker diarization and laughter detection. Moreover, the ALISP-based system has obtained the best results in ETAPE 2011 (Evaluations en Treatment Automatique de la Parole) evaluation campaign for the speaker diarization task.
Keywords :
data compression; indexing; speaker recognition; ALISP tools; BLAST technique; ETAPE 2011 evaluation campaign; Evaluations en Treatment Automatique de la Parole evaluation campaign; YACAST; advertisement; approximate matching algorithm; audio data; audio identification; audio items; audio motif discovery; audio retrieval; automatic language independent speech processing approach; data compression; generic audio indexing; laughter detection; private radio broadcast database; radio stream; songs; speaker diarization; speaker diarization task; Approximation algorithms; Data models; Hidden Markov models; Indexing; Stability analysis; Vectors; ALISP units; audio indexing; data compression; data-driven audio sequencing;
Conference_Titel :
Data Compression Conference (DCC), 2014
Conference_Location :
Snowbird, UT
DOI :
10.1109/DCC.2014.81