HOME JOURNALS CONTACT

Information Technology Journal

Year: 2012 | Volume: 11 | Issue: 3 | Page No.: 319-323
DOI: 10.3923/itj.2012.319.323
Kernel Sparse Feature Selection Based on Semantics in Text Classification
Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang

Abstract: Sparse representation originating from signal compressed sensing theory has attracted increasing interest in computer vision research community. In this paper, we present a novel non-parametric feature selection method based on sparse representation in text classification. In order to solve the problem of polysems and synonyms in VSM, we construct semantic structure to represent document with PLSA. Motivated by the fact that kernel trick can capture the nonlinear similarity of features, which may reduce the feature quantization error, we propose Empirical Kernel Sparse Representation (EKSR). We apply EKSR to reconstruct weight vector between samples, then design evaluating mechanism CKernel Sparsity Score (KSS) to select excellent feature subset. As the natural discriminative power of EKSR, KSS can find Agood@ feature which preserves the original structure with less information loss. The results of experiment both on English and Chinese dataset demonstrate the effectiveness of the proposed method.

Fulltext PDF Fulltext HTML

How to cite this article
Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang, 2012. Kernel Sparse Feature Selection Based on Semantics in Text Classification. Information Technology Journal, 11: 319-323.

Related Articles:
© Science Alert. All Rights Reserved