Result: Survey of Binary Code Similarity Detection Method

Title:
Survey of Binary Code Similarity Detection Method
Source:
Jisuanji kexue, Vol 52, Iss 6, Pp 365-380 (2025)
Publisher Information:
Editorial office of Computer Science, 2025.
Publication Year:
2025
Collection:
LCC:Computer software
LCC:Technology (General)
Document Type:
Academic journal article
File Description:
electronic resource
Language:
Chinese
ISSN:
1002-137X
DOI:
10.11896/jsjkx.240400003
Accession Number:
edsdoj.8b8c251e65da471d85c3a69b76187b8c
Database:
Directory of Open Access Journals

Further Information

Code similarity detection can be divided into two types according to the research object:source code similarity detection and binary code similarity detection,which are commonly used in scenarios such as malicious code identification,vulnerability search,and copyright protection.Based on the current domestic Internet environment,programs are usually released in the form of binary files,and most programs cannot directly obtain source code.Therefore,in related research in the field of software security,the application scope of binary code similarity detection is relatively wider.Starting from the definition and implementation process of binary code similarity detection,according to the code representation form,it is divided into three categories:text cha-racter-based,code embedding-based,and graph embedding-based.The classic binary code similarity detection methods and the recent five years of research and development are compared.A total of 19 documents on new methods are sorted out,and various methods are analyzed and summarized based on multi-architecture,Baseline,benchmark datasets and detection performance.Finally,current problems and possible future research directions are analyzed based on the development of new methods.