Subscribe Now Subscribe Today
Science Alert
 
FOLLOW US:     Facebook     Twitter
Blue
   
Curve Top
Asian Journal of Information Management
  Year: 2008 | Volume: 2 | Issue: 1 | Page No.: 14-22
DOI: 10.3923/ajim.2008.14.22
 
Facebook Twitter Digg Reddit Linkedin StumbleUpon E-mail
A Comprehensive Comparative Study Using Vector Space Model with K-Nearest Neighbor on Text Categorization Data
Wa`el Musa Hadi, Fadi Thabtah, Salahideen Mousa, Samer Al Hawari, Ghassan Kanaan and Jafar Ababnih

Abstract:
On 20 text categorization data sets, the research investigated different variations of VSM using KNN algorithm and different term weighting approaches compared in term of F1 measure. The experimental results provide evidence that Dice and Jaccard Coefficient outperformed the Cosine Coefficient approach with regards to F1 results and the Dice-based TF. IDF achieved the highest average scores.
PDF Fulltext XML References Citation Report Citation
How to cite this article:

Wa`el Musa Hadi, Fadi Thabtah, Salahideen Mousa, Samer Al Hawari, Ghassan Kanaan and Jafar Ababnih, 2008. A Comprehensive Comparative Study Using Vector Space Model with K-Nearest Neighbor on Text Categorization Data. Asian Journal of Information Management, 2: 14-22.

DOI: 10.3923/ajim.2008.14.22

URL: https://scialert.net/abstract/?doi=ajim.2008.14.22

 
COMMENT ON THIS PAPER
 
 
 

 

 
 
 
 
 
 
 
 
 

 
 
 
 
 

       

       

Curve Bottom