مرکز منطقه ای اطلاع رساني علوم و فناوري - Semantic Learning for Audio Applications: A Computer Vision Approach

DocumentCode :

2076164

Title :

Semantic Learning for Audio Applications: A Computer Vision Approach

Author :

Sukthankar, Rahul ; Ke, Yan ; Hoiem, Derek

Author_Institution :

Intel Research Pittsburgh, Carnegie Mellon

fYear :

2006

fDate :

17-22 June 2006

Firstpage :

112

Lastpage :

112

Abstract :

Recent work in machine learning has significantly benefited semantic extraction tasks in computer vision, particularly for object recognition and image retrieval. We argue that the computer vision techniques that have been successfully applied in those settings can effectively be translated to other domains, such as audio. This claim is supported by recent results in music vs. speech classification, structure from sound, robust music identification and sound object recognition. This paper focuses on two such audio applications and demonstrates how ideas from computer vision map naturally to these problems.

Keywords :

Acoustic noise; Application software; Computer vision; Image analysis; Machine learning; Music information retrieval; Object detection; Object recognition; Robustness; Speech analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition Workshop, 2006. CVPRW '06. Conference on

Print_ISBN :

0-7695-2646-2

Type :

conf

DOI :

10.1109/CVPRW.2006.191

Filename :

1640555

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2076164