DocumentCode
417707
Title
Exploratory analysis and visualization of speech and music by locally linear embedding
Author
Jain, Viren ; Saul, Lawrence K.
Author_Institution
Dept. of Comput. & Inf. Sci., Univ. of Pennsylvania, Philadelphia, PA, USA
Volume
3
fYear
2004
fDate
17-21 May 2004
Abstract
Many problems in voice recognition and audio processing involve feature extraction from raw waveforms. The goal of feature extraction is to reduce the dimensionality of the audio signal while preserving the informative signatures that, for example, distinguish different phonemes in speech or identify particular instruments in music. If the acoustic variability of a data set is described by a small number of continuous features, then we can imagine the data as lying on a low dimensional manifold in the high dimensional space of all possible waveforms. Locally linear embedding (LLE) is an unsupervised learning algorithm for feature extraction in this setting. In this paper, we present results from the exploratory analysis and visualization of speech and music by LLE.
Keywords
audio signal processing; data visualisation; feature extraction; music; reduced order systems; speech processing; unsupervised learning; LLE unsupervised learning algorithm; audio processing; audio signal informative signatures; continuous features; data set acoustic variability; dimensionality reduction; high dimensional data representation; locally linear embedding; low dimensional manifold; music visualization; musical instruments; raw waveform feature extraction; speech analysis; speech phonemes; speech visualization; voice recognition; waveform high dimensional space; Data visualization; Embedded computing; Feature extraction; Instruments; Multiple signal classification; Principal component analysis; Signal processing; Signal processing algorithms; Speech analysis; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326712
Filename
1326712
Link To Document