Search. Read. Cite.

Easy to search. Easy to read. Easy to cite with credible sources.

Journal of Applied Sciences

Year: 2013  |  Volume: 13  |  Issue: 14  |  Page No.: 2843 - 2847

Programs Similarity Measure Based on Tree Structure and Eigenvector

Dongmei Li, Di Zhang, Zhifang Wei and Jianxin Wang

Abstract

Program similarity measure technology is to detect the similarity among the programs by certain means. It is widely used in teaching and protection of intellectual property rights. Most current program similarity measure technologies suffer from low accuracy. Based on previous studies of program similarity measure method, this study proposes a method based on tree structure and eigenvector. Firstly, the actual frequency of keywords in the program is counted through employing hierarchical tree structure. Sencondly, the frequency is applied to generate eigenvector of program and the traditional method based on vector is improved. Finally, a program similarity measure system named Cplag is implemented which can be used to measure C language program similarity. Experimental results indicate that CPlag has apparent advantages in some aspects compared with famous Jplag.

Cited References Fulltext