DocumentCode
243163
Title
Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
Author
Bandala, Argel A. ; Lim, Allimzon M. ; Cai, Mark Anthony D. ; Bacar, Allan Jeffrey C. ; Manosca, Aynna Claudine G.
Author_Institution
Dept. of Electron. & Commun. Eng., De La Salle Univ., Manila, Philippines
fYear
2014
fDate
22-25 Oct. 2014
Firstpage
1
Lastpage
6
Abstract
This paper is about the creation of an artificial neural network (ANN) in MATLAB to analyze the features extracted from calculating the mel-frequency cepstral coefficients (MFCC) of the raw audio data. The paper explains basic concepts about the ANN, as well as the MFCC and other relevant theories. Regarding the design of the ANN, it uses multiple infant crying sounds, as well as non-crying sounds, to create a sample training set with a corresponding target that determines whether the sound is a cry or not. The paper uses relevant concepts heavily utilized in speech recognition for the design of the infant cry recognition, modifies them, and adds a few more calculations to fit the desired application to compensate for the differences present in a cry from human speech.
Keywords
audio signal processing; cepstral analysis; feature extraction; neural nets; speech recognition; ANN; MATLAB; MFCC; artificial neural network; feature extraction; infant cry recognition; mel-frequency cepstral coefficients; raw audio data; speech recognition; Artificial neural networks; Feature extraction; Mel frequency cepstral coefficient; Neurons; Pediatrics; Speech recognition; Training; MFCC; activation function; cepstrum; mel scale; neural network; neuron;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2014 - 2014 IEEE Region 10 Conference
Conference_Location
Bangkok
ISSN
2159-3442
Print_ISBN
978-1-4799-4076-9
Type
conf
DOI
10.1109/TENCON.2014.7022407
Filename
7022407
Link To Document