Wen Li
Information Engineering School, Nanchang University, 999Xuefu Road, Honggutan New District, Nanchang, 330031, China
Weili Wang
Information Engineering School, Nanchang University, 999Xuefu Road, Honggutan New District, Nanchang, 330031, China
Ling Chai
Information Engineering School, Nanchang University, 999Xuefu Road, Honggutan New District, Nanchang, 330031, China
ABSTRACT
As one of the important techniques in large-scale data organizing, text categorization has been widely investigated. But the existing hierarchical classification methods often suffer from inter-level error transmission, namely blocking. In this paper, blocking distribution based topology reconstruction method was proposed for hierarchical text categorization problem. Firstly, blocking distribution recognition technique is put forward to mining out the serious high-level misclassification class. Subsequently, original hierarchical structure are reconstructed using blocking direction information obtained ahead, which increasing the path for the blocking instance to the correct subclass. Experimental studies on Chinese text classification benchmark Tan Corp, demonstrate that the proposed algorithm performs better than the traditional hierarchical and state-of-the-art flat classification strategies.
PDF References Citation
How to cite this article
Wen Li, Weili Wang and Ling Chai, 2013. Blocking Distribution Based Hierarchical Reconstruction for Text Categorization. Journal of Applied Sciences, 13: 2123-2126.
DOI: 10.3923/jas.2013.2123.2126
URL: https://scialert.net/abstract/?doi=jas.2013.2123.2126
DOI: 10.3923/jas.2013.2123.2126
URL: https://scialert.net/abstract/?doi=jas.2013.2123.2126
REFERENCES
- Ceci, M. and D. Malerba, 2007. Classifying web documents in a hierarchy of categories: A comprehensive study. J. Intell. Inform. Syst., 28: 37-78.
CrossRefDirect Link - Sun, A., E.P. Lim and W.K. Ng, 2003. Performance measurement framework for hierarchical text classification. J. Am. Soc. Inform. Sci. Technol., 54: 1014-1028.
CrossRefDirect Link - Li, W., D.Q. Miao, W. Wang and N. Zhang, 2010. Hierarchical rough decision theoretic framework for text classification. Proceedings of the 9th IEEE International Conference on Cognitive Informatics, July 7-9, 2010, Beijing, China, pp: 484-489.
CrossRef - Sun, A., E.P. Lim, W.K. Ng and J. Srivastava, 2004. Blocking reduction strategies in hierarchical text classification. IEEE Trans. Knowl. Data Eng., 16: 1305-1308.
CrossRefDirect Link - Tan, S., 2006. An effective refinement strategy for KNN Text classifier. Expert Syst. Applic., 30: 290-298.
CrossRef - Joachims, T., 1998. Text categorization with support vector machines: Learning with many relevant features. Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, April 21-23, 1998, Springer, Berlin, Heidelberg, pp: 137-142.
CrossRefDirect Link