Author/Authors :
Jahanirad، نويسنده , , Mehdi and Wahab، نويسنده , , Ainuddin Wahid Abdul and Anuar، نويسنده , , Nor Badrul and Idris، نويسنده , , Mohd Yamani Idna and Ayub، نويسنده , , Mohd Nizam، نويسنده ,
Abstract :
Mel-frequency cepstrum coefficients (MFCCs) extracted from speech recordings has been proven to be the most effective feature set to capture the frequency spectra produced by a recording device. This paper claims that audio evidence such as a recorded call contains intrinsic artifacts at both transmitting and receiving ends. These artifacts allow recognition of the source mobile device on the other end through recording the call. However, MFCC features are contextualized by the speech contents, speaker׳s characteristics and environments. Thus, a device-based technique needs to consider the identification of source transmission devices and improve the robustness of MFCCs. This paper aims to investigate the use of entropy of Mel-cepstrum coefficients to extract intrinsic mobile device features from near-silent segments, where it remains robust to the characteristics of different speakers. The proposed features are compared with five different combinations of statistical moments of MFCCs, including the mean, standard deviation, variance, skewness, and kurtosis of MFCCs. All feature sets are analyzed by using five supervised learning techniques, namely, support vector machine, naïve Bayesian, neural network, linear logistic regression, and rotation forest classifier, as well as two unsupervised learning techniques known as probabilistic-based and nearest-neighbor-based algorithms. The experimental results show that the best performance was achieved with entropy–MFCC features that use the naïve Bayesian classifier, which resulted in an average accuracy of 99.99% among 21 mobile devices.
Keywords :
Mel-frequency cepstrum coefficient , entropy , Device-based detection technique , Pattern recognition