HOME JOURNALS CONTACT

Journal of Applied Sciences

Year: 2013 | Volume: 13 | Issue: 22 | Page No.: 5349-5355
DOI: 10.3923/jas.2013.5349.5355
Practical Speech Emotion Recognition Based on Im-BFOA
Bao Yongqiang, Xi Ji and Xu Haiyan

Abstract: For Support Vector Machines (SVM) parameter optimization problem, we propose an improved bacterial foraging algorithm (Im-BFOA) and increase its learning ability in the practical speech emotion recognition. Firstly, we introduce simulated annealing (SA),Gaussian mutation and chaotic disturbance operator into BFOA to balance the efficiency of search and the diversity of population. Secondly, use Im-BFOA to optimize SVM parameters and propose a Im-BFOA-SVM method; Thirdly, based on prosodic features, quality features and chaotic features of speech, build a 144-dimension emotional feature vector and use FDR to dimension reduction to 5 dimensions; Finally, test the algorithm performance on the practical speech emotion database and compare the proposed algorithm Particle Swarm Optimization(PSO) algorithm to optimize the parameters of SVM (PSO-SVM method) with basic SVM methods and Back-Propagation (BP) neural network method. Experimental results show that average recognition rate of the Im-BFOA-SVM method reached 78.1%, respectively, higher than PSO-SVM method, SVM methods and BP neural network method of 3.7, 5.4 and 9.8%, indicating that Im-BFOA is a kind of effective SVM parameter selection method which can significantly improve practical speech emotion recognition rate.

Fulltext PDF

How to cite this article
Bao Yongqiang, Xi Ji and Xu Haiyan, 2013. Practical Speech Emotion Recognition Based on Im-BFOA. Journal of Applied Sciences, 13: 5349-5355.

Keywords: Im-BFOA, SVM, speech emotion recognition and neural network

REFERENCES

  • Yeh, J.H., T.L. Pao, C.Y. Lin, Y.W. Tsai and Y.T. Chen, 2011. Segment-based emotion recognition from continuous Mandarin Chinese speech. Comput. Hum. Behav., 27: 1545-1552.
    CrossRef    


  • Zhao, Y., L. Zhao, C. Zou and Y. Yu, 2008. Speech emotion recognition using modified quadratic discrimination function. J. Electron., 25: 840-844.
    CrossRef    


  • Nicholson, J., K. Takahashi and R. Nakatsu, 2000. Emotion recognition in speech using neural networks. Neural Comput. Appl., 9: 290-296.
    CrossRef    


  • Nwe, T.L., S.W. Foo and L.C. De Silva, 2003. Speech emotion recognition using hidden Markov models. Speech Commun., 41: 603-623.
    CrossRef    Direct Link    


  • Huang, C.W., Y. Zhao, Y. Jin, Y.H. Yu and L. Zhao, 2011. A study on feature analysis and recognition of practical speech emotion. J. Electron. Inform. Technol., 33: 112-116.
    CrossRef    Direct Link    


  • Cortes, C. and V. Vapnik, 1995. Support-vector networks. Mach. Learn., 20: 273-297.
    CrossRef    Direct Link    


  • Shao, X.G., H.Z. Yang and G. Chen, 2006. Parameters selection and application of support vector machines based on particle swarm optimization algorithm. Control Theory Appl., 23: 740-743.
    Direct Link    


  • Chen, P.W., J.Y. Wang and H.M. Lee, 2004. Model selection of SVMs using GA approach. Proceedings of the IEEE International Joint Conference on Neural Networks, Volume 3, July 25-29, 2004, Budapest, Hungary, pp: 2035-2040.


  • Passino, K.M., 2002. Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Syst., 22: 52-67.
    CrossRef    


  • Mishra, S., 2005. A hybrid least square-fuzzy bacterial foraging strategy for harmonic estimation. IEEE Trans. Evol. Comput., 9: 61-73.
    CrossRef    Direct Link    


  • Abraham, A., A. Biswas, S. Dasgupta and S. Das, 2008. Analysis of reproduction operator in bacterial foraging optimization algorithm. Proceedings of the IEEE Congress on Evolutionary Computation, June 1-6, 2008, Hong Kong, pp: 1476-1483.


  • Brooks, S.P. and B.J.T. Morgan, 1995. Optimization using simulated annealing. J. R. Stat. Soc. Ser. D (Statistician), 44: 241-257.
    Direct Link    


  • Amir, N., S. Ron and N. Laor, 2000. Analysis of an emotional speech corpus in Hebrew based on objective criteria. Proceedings of the Tutorial and Research Workshop on Speech and Emotion, September 5-7, 2000, Newcastle, Northern Ireland, UK., pp: 29-33.


  • Paeschke, A. and W.F. Sendlmeier, 2000. Prosodic characteristics of emotional speech: Measurements of fundamental frequency movements. Proceedings of the Tutorial and Research Workshop on Speech and Emotion, September 5-7, 2000, Newcastle, Northern Ireland, UK., pp: 75-80.


  • Ververidis, D., C. Kotropoulos and I. Pitas, 2004. Automatic emotional speech classification. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Volume 1, May 17-21, 2004, Montreal, Canada, pp: 593-596.

  • © Science Alert. All Rights Reserved