Bao Yongqiang
School of Communication Engineering, Nanjing Institute of Technology, 211167, Nanjing, China
Xi Ji
School of IOT Engineering, Hohai University, 213022, Changzhou, China
Xu Haiyan
School of IOT Engineering, Hohai University, 213022, Changzhou, China
ABSTRACT
For Support Vector Machines (SVM) parameter optimization problem, we propose an improved bacterial foraging algorithm (Im-BFOA) and increase its learning ability in the practical speech emotion recognition. Firstly, we introduce simulated annealing (SA),Gaussian mutation and chaotic disturbance operator into BFOA to balance the efficiency of search and the diversity of population. Secondly, use Im-BFOA to optimize SVM parameters and propose a Im-BFOA-SVM method; Thirdly, based on prosodic features, quality features and chaotic features of speech, build a 144-dimension emotional feature vector and use FDR to dimension reduction to 5 dimensions; Finally, test the algorithm performance on the practical speech emotion database and compare the proposed algorithm Particle Swarm Optimization(PSO) algorithm to optimize the parameters of SVM (PSO-SVM method) with basic SVM methods and Back-Propagation (BP) neural network method. Experimental results show that average recognition rate of the Im-BFOA-SVM method reached 78.1%, respectively, higher than PSO-SVM method, SVM methods and BP neural network method of 3.7, 5.4 and 9.8%, indicating that Im-BFOA is a kind of effective SVM parameter selection method which can significantly improve practical speech emotion recognition rate.
PDF References Citation
Received: June 03, 2013;
Accepted: October 08, 2013;
Published: November 13, 2013
How to cite this article
Bao Yongqiang, Xi Ji and Xu Haiyan, 2013. Practical Speech Emotion Recognition Based on Im-BFOA. Journal of Applied Sciences, 13: 5349-5355.
DOI: 10.3923/jas.2013.5349.5355
URL: https://scialert.net/abstract/?doi=jas.2013.5349.5355
DOI: 10.3923/jas.2013.5349.5355
URL: https://scialert.net/abstract/?doi=jas.2013.5349.5355
REFERENCES
- Yeh, J.H., T.L. Pao, C.Y. Lin, Y.W. Tsai and Y.T. Chen, 2011. Segment-based emotion recognition from continuous Mandarin Chinese speech. Comput. Hum. Behav., 27: 1545-1552.
CrossRef - Zhao, Y., L. Zhao, C. Zou and Y. Yu, 2008. Speech emotion recognition using modified quadratic discrimination function. J. Electron., 25: 840-844.
CrossRef - Nicholson, J., K. Takahashi and R. Nakatsu, 2000. Emotion recognition in speech using neural networks. Neural Comput. Appl., 9: 290-296.
CrossRef - Nwe, T.L., S.W. Foo and L.C. De Silva, 2003. Speech emotion recognition using hidden Markov models. Speech Commun., 41: 603-623.
CrossRefDirect Link - Huang, C.W., Y. Zhao, Y. Jin, Y.H. Yu and L. Zhao, 2011. A study on feature analysis and recognition of practical speech emotion. J. Electron. Inform. Technol., 33: 112-116.
CrossRefDirect Link - Cortes, C. and V. Vapnik, 1995. Support-vector networks. Mach. Learn., 20: 273-297.
CrossRefDirect Link - Shao, X.G., H.Z. Yang and G. Chen, 2006. Parameters selection and application of support vector machines based on particle swarm optimization algorithm. Control Theory Appl., 23: 740-743.
Direct Link - Chen, P.W., J.Y. Wang and H.M. Lee, 2004. Model selection of SVMs using GA approach. Proceedings of the IEEE International Joint Conference on Neural Networks, Volume 3, July 25-29, 2004, Budapest, Hungary, pp: 2035-2040.
CrossRef - Passino, K.M., 2002. Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Syst., 22: 52-67.
CrossRef - Mishra, S., 2005. A hybrid least square-fuzzy bacterial foraging strategy for harmonic estimation. IEEE Trans. Evol. Comput., 9: 61-73.
CrossRefDirect Link - Abraham, A., A. Biswas, S. Dasgupta and S. Das, 2008. Analysis of reproduction operator in bacterial foraging optimization algorithm. Proceedings of the IEEE Congress on Evolutionary Computation, June 1-6, 2008, Hong Kong, pp: 1476-1483.
CrossRef - Brooks, S.P. and B.J.T. Morgan, 1995. Optimization using simulated annealing. J. R. Stat. Soc. Ser. D (Statistician), 44: 241-257.
Direct Link - Paeschke, A. and W.F. Sendlmeier, 2000. Prosodic characteristics of emotional speech: Measurements of fundamental frequency movements. Proceedings of the Tutorial and Research Workshop on Speech and Emotion, September 5-7, 2000, Newcastle, Northern Ireland, UK., pp: 75-80.
Direct Link - Ververidis, D., C. Kotropoulos and I. Pitas, 2004. Automatic emotional speech classification. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Volume 1, May 17-21, 2004, Montreal, Canada, pp: 593-596.
CrossRef