Research Article Open Access

SPEECH/MUSIC CLASSIFICATION USING WAVELET BASED FEATURE EXTRACTION TECHNIQUES

Thiruvengatanadhan Ramalingam1 and P. Dhanalakshmi1
  • 1 Annamalai University, India

Abstract

Audio classification serves as the fundamental step towards the rapid growth in audio data volume. Due to the increasing size of the multimedia sources speech and music classification is one of the most important issues for multimedia information retrieval. In this work a speech/music discrimination system is developed which utilizes the Discrete Wavelet Transform (DWT) as the acoustic feature. Multi resolution analysis is the most significant statistical way to extract the features from the input signal and in this study, a method is deployed to model the extracted wavelet feature. Support Vector Machines (SVM) are based on the principle of structural risk minimization. SVM is applied to classify audio into their classes namely speech and music, by learning from training data. Then the proposed method extends the application of Gaussian Mixture Models (GMM) to estimate the probability density function using maximum likelihood decision methods. The system shows significant results with an accuracy of 94.5%.

Journal of Computer Science
Volume 10 No. 1, 2014, 34-44

DOI: https://doi.org/10.3844/jcssp.2014.34.44

Submitted On: 2 October 2013 Published On: 8 November 2013

How to Cite: Ramalingam, T. & Dhanalakshmi, P. (2014). SPEECH/MUSIC CLASSIFICATION USING WAVELET BASED FEATURE EXTRACTION TECHNIQUES. Journal of Computer Science, 10(1), 34-44. https://doi.org/10.3844/jcssp.2014.34.44

  • 3,193 Views
  • 4,377 Downloads
  • 19 Citations

Download

Keywords

  • Audio Classification
  • Feature Extraction
  • Wavelet Transform
  • Support Vector Machine (SVM)
  • Gaussian Mixture Model (GMM)