Publications

Detailed Information

Measuring Source Code Similarity by Finding Similar Subgraph with an Incremental Genetic algorithm

DC Field Value Language
dc.contributor.authorKim, Jinhyunen
dc.contributor.authorChoi, HyukGeun-
dc.contributor.authorYun, Hansang-
dc.contributor.authorMoon, Byung-Ro-
dc.date.accessioned2017-04-19T00:20:08Z-
dc.date.available2017-12-01T16:09:14Z-
dc.date.issued2016-07-
dc.identifier.citationProceeding GECCO '16 Proceedings of the Genetic and Evolutionary Computation Conference 2016, pp. 925-932-
dc.identifier.isbn978-1-4503-4206-3-
dc.identifier.urihttps://hdl.handle.net/10371/116915-
dc.description.abstractMeasuring similarity between source codes has lots of applications, such as code plagiarism detection, code clone detection, and malware detection. A variety of methods for the measurement have been developed and program-dependence-graph based methods are known to be well working against disguise techniques. But these methods usually rely on solving NP-hard problems which cause a scalability issue. In this paper, we propose a genetic algorithm to measure the similarity between two codes by solving an error correcting subgraph isomorphism problem on dependence graphs. We propose a new cost function for this problem, which reflects the characteristic of the source codes. An incremental genetic algorithm is used to solve the problem. The size of the graph to be searched is gradually increasing during the evolutionary process. We developed new operators for the algorithm, and the overall system is tested on some real world data. Experimental results showed that the system successfully works on code plagiarism detection and malware detection. The similarity computed by the system turned out to reflect the similarity between the codes properly.-
dc.language.isoen-
dc.publisherACM/IEEEen
dc.subjectMeasuring Source Code Similarity by Finding Similar Subgraph with an Incremental Genetic algorithmen
dc.subject복합학en
dc.subjectCode similarity-
dc.subjectsubgraph isomorphism problem-
dc.subjectincremental genetic algorithm-
dc.subjectprogram dependence graph-
dc.titleMeasuring Source Code Similarity by Finding Similar Subgraph with an Incremental Genetic algorithmen
dc.typeArticleen
dc.contributor.AlternativeAuthor김진현-
dc.contributor.AlternativeAuthor최혁근-
dc.contributor.AlternativeAuthor윤한상-
dc.contributor.AlternativeAuthor문병로-
dc.identifier.doi10.1145/2908812.2908870-
dc.description.srndOAIID:RECH_ACHV_DSTSH_NO:T201700181-
dc.description.srndRECH_ACHV_FG:RR00200001-
dc.description.srndADJUST_YN:-
dc.description.srndEMP_ID:A001584-
dc.description.srndCITE_RATE:0-
dc.description.srndDEPT_NM:컴퓨터공학부-
dc.description.srndEMAIL:moon@snu.ac.kr-
dc.description.srndSCOPUS_YN:Y-
dc.description.srndCONFIRM:Y-
dc.identifier.srndT201700181-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share