Kernel Sparse Feature Selection Based on Semantics in Text Classification

Abstract: Sparse representation originating from signal compressed sensing theory has attracted increasing interest in computer vision research community. In this paper, we present a novel non-parametric feature selection method based on sparse representation in text classification. In order to solve the problem of polysems and synonyms in VSM, we construct semantic structure to represent document with PLSA. Motivated by the fact that kernel trick can capture the nonlinear similarity of features, which may reduce the feature quantization error, we propose Empirical Kernel Sparse Representation (EKSR). We apply EKSR to reconstruct weight vector between samples, then design evaluating mechanism CKernel Sparsity Score (KSS) to select excellent feature subset. As the natural discriminative power of EKSR, KSS can find Agood@ feature which preserves the original structure with less information loss. The results of experiment both on English and Chinese dataset demonstrate the effectiveness of the proposed method.

HOME JOURNALS CONTACT

Information Technology Journal

Year: 2012 | Volume: 11 | Issue: 3 | Page No.: 319-323
DOI: 10.3923/itj.2012.319.323

Kernel Sparse Feature Selection Based on Semantics in Text Classification

Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang

How to cite this article

Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang, 2012. Kernel Sparse Feature Selection Based on Semantics in Text Classification. Information Technology Journal, 11: 319-323.

Related Articles:

HOME JOURNALS CONTACT

Information Technology Journal

Year: 2012 | Volume: 11 | Issue: 3 | Page No.: 319-323 DOI: 10.3923/itj.2012.319.323

Kernel Sparse Feature Selection Based on Semantics in Text Classification

Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang

How to cite this article

Zhantao Deng, Guyu Hu, Zhisong Pan and Yanyan Zhang, 2012. Kernel Sparse Feature Selection Based on Semantics in Text Classification. Information Technology Journal, 11: 319-323.

Related Articles:

Year: 2012 | Volume: 11 | Issue: 3 | Page No.: 319-323
DOI: 10.3923/itj.2012.319.323