Dongmei Li
School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
Di Zhang
School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
Zhifang Wei
Department of School of Economics and Management, Tsinghua University, 100083, Beijing, China
Jianxin Wang
School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
ABSTRACT
Program similarity measure technology is to detect the similarity among the programs by certain means. It is widely used in teaching and protection of intellectual property rights. Most current program similarity measure technologies suffer from low accuracy. Based on previous studies of program similarity measure method, this study proposes a method based on tree structure and eigenvector. Firstly, the actual frequency of keywords in the program is counted through employing hierarchical tree structure. Sencondly, the frequency is applied to generate eigenvector of program and the traditional method based on vector is improved. Finally, a program similarity measure system named Cplag is implemented which can be used to measure C language program similarity. Experimental results indicate that CPlag has apparent advantages in some aspects compared with famous Jplag.
PDF References Citation
How to cite this article
Dongmei Li, Di Zhang, Zhifang Wei and Jianxin Wang, 2013. Programs Similarity Measure Based on Tree Structure and Eigenvector. Journal of Applied Sciences, 13: 2843-2847.
DOI: 10.3923/jas.2013.2843.2847
URL: https://scialert.net/abstract/?doi=jas.2013.2843.2847
DOI: 10.3923/jas.2013.2843.2847
URL: https://scialert.net/abstract/?doi=jas.2013.2843.2847
REFERENCES
- Faidhi, J.A.W. and S.K. Robinson, 1987. An empirical approach for detecting program similarity and plagiarism within a university programming environment. Comput. Edu., 11: 11-19.
CrossRef - Grier, S., 1981. A tool that detects plagiarism in Pascal programs. Proceedings of the 12th SIGCSE Symposium on Computer Science Education, February 4-6, 1981, New York, USA., pp: 15-20.
CrossRef - Prechelt, L., G. Malpohl and M. Philippsen, 2002. Finding plagiarisms among a set of programs with JPlag. J. Univ. Comput. Sci., 8: 1016-1038.
Direct Link - Schleimer, S., D.S. Wilkerson and A. Aiken, 2003. Winnowing: Local algorithms for document fingerprinting. Proceedings of the ACM SIGMOD International Conference on Management of Data, June 9-12, 2003, San Diego, California, USA., pp: 76-85.
CrossRef - Wise, M.J., 1992. Detection of similarities in student program: YAP'ing may be preferable to Plague'ing. Proceedings of the 23rd SIGCSE Technical Symposium on Computer Science Education, March 5-6, 1992, Kansas City, Missouri, USA., pp: 268-271.
CrossRef - Wise, M.J., 1996. YAP3: Improved detection of similarities in computer program and other texts. Proceedings of the 27th SIGCSE technical symposium on Computer Science Education, March 10-12, 1996, New York, NY, USA., pp: 130-134.
CrossRef - Zhao, C.H., H.H. Yan and M.Z. Jin, 2008. Approach based on compiling optimization and disassembling to detect program similarity. J. Beijing Univ. Aeronautics Astronautics, 34: 711-715.
Direct Link