Subscribe Now Subscribe Today
Research Article
 

A Fast Classification Algorithm for Big Data Based on KNN



Kun Niu, Fang Zhao and Shubo Zhang
 
Facebook Twitter Digg Reddit Linkedin StumbleUpon E-mail
ABSTRACT

As massive data acquisition and storage becomes increasing affordable, a wide variety of researchers are employing methods to engage in sophisticated data mining. This study focuses on fast classification for big data based on a traditional classification method KNN (K-Nearest Neighbor). We reform the standard KNN algorithm and present a new algorithm named NFC (Neighbor Filter Classification). The NFC algorithm firstly computes the class distribution in each attribute of original dataset and sorts attributes by classification contribution. Secondly, NFC gets the model of the KNN result on training set to estimate the finite scope of the k-nearest neighbor. Then NFC uses test set to get the proper parameters and updates model regularly to make it efficient. Experimental results show the excellent ability of classification and low computation cost of NFC.

Services
Related Articles in ASCI
Similar Articles in this Journal
Search in Google Scholar
View Citation
Report Citation

 
  How to cite this article:

Kun Niu, Fang Zhao and Shubo Zhang, 2013. A Fast Classification Algorithm for Big Data Based on KNN. Journal of Applied Sciences, 13: 2208-2212.

DOI: 10.3923/jas.2013.2208.2212

URL: https://scialert.net/abstract/?doi=jas.2013.2208.2212
 

REFERENCES
Cohen, J., B. Dolan, M. Dunlap, J.M. Hellerstein and C. Welton, 2009. MAD skills: New analysis practices for big data. VLDB, Lyon, France, August 24-28, 2009. VLDB Endowment Inc., pp: 1481-1492.

Qin, X.P., H.J. Wang, X.Y. Du and S. Wang, 2012. Big data analysis-competition and symbiosis of RDBMS and MapReduce. J. Software, 23: 32-45.
CrossRef  |  Direct Link  |  

Shen, D.R., G. Yu, X.T. Wang, T.Z. Nie and Y. Kou, 2013. Survey on NoSQL for management of big data. Chin. J. Soft., 24: 1786-1803.

Wang, S., H.J. Wang, X.P. Qin and X. Zhou, 2011. Architecting big data: Challenges, studies and forecasts. Chin. J. Comp., 34: 1741-1752.
CrossRef  |  

©  2020 Science Alert. All Rights Reserved