Treffer: A systematic literature review on the applications of recurrent neural networks in code clone research.

Title:
A systematic literature review on the applications of recurrent neural networks in code clone research.
Authors:
Quradaa FH; Department of Computer Science, University of Peshawar, Peshawar, Pakistan.; Department of Computer Science, Aden Community College, Aden, Yemen., Shahzad S; Department of Computer Science, University of Peshawar, Peshawar, Pakistan., Almoqbily RS; Department of Computer Science, University of Peshawar, Peshawar, Pakistan.; Department of Computer Science, Aden Community College, Aden, Yemen.
Source:
PloS one [PLoS One] 2024 Feb 02; Vol. 19 (2), pp. e0296858. Date of Electronic Publication: 2024 Feb 02 (Print Publication: 2024).
Publication Type:
Systematic Review; Journal Article
Language:
English
Journal Info:
Publisher: Public Library of Science Country of Publication: United States NLM ID: 101285081 Publication Model: eCollection Cited Medium: Internet ISSN: 1932-6203 (Electronic) Linking ISSN: 19326203 NLM ISO Abbreviation: PLoS One
Imprint Name(s):
Original Publication: San Francisco, CA : Public Library of Science
References:
BMC Genomics. 2020 Jan 2;21(1):6. (PMID: 31898477)
Neural Comput. 1997 Nov 15;9(8):1735-80. (PMID: 9377276)
Neural Comput. 2000 Oct;12(10):2451-71. (PMID: 11032042)
Curr Top Med Chem. 2008;8(18):1691-709. (PMID: 19075775)
IEEE Trans Neural Netw. 1994;5(2):157-66. (PMID: 18267787)
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. (PMID: 16112549)
J Big Data. 2021;8(1):53. (PMID: 33816053)
Entry Date(s):
Date Created: 20240202 Date Completed: 20240807 Latest Revision: 20240807
Update Code:
20250114
PubMed Central ID:
PMC10836701
DOI:
10.1371/journal.pone.0296858
PMID:
38306372
Database:
MEDLINE

Weitere Informationen

Code clones, referring to code fragments that are either similar or identical and are copied and pasted within software systems, have negative effects on both software quality and maintenance. The objective of this work is to systematically review and analyze recurrent neural network techniques used to detect code clones to shed light on the current techniques and offer valuable knowledge to the research community. Upon applying the review protocol, we have successfully identified 20 primary studies within this field from a total of 2099 studies. A deep investigation of these studies reveals that nine recurrent neural network techniques have been utilized for code clone detection, with a notable preference for LSTM techniques. These techniques have demonstrated their efficacy in detecting both syntactic and semantic clones, often utilizing abstract syntax trees for source code representation. Moreover, we observed that most studies applied evaluation metrics like F-score, precision, and recall. Additionally, these studies frequently utilized datasets extracted from open-source systems coded in Java and C programming languages. Notably, the Graph-LSTM technique exhibited superior performance. PyTorch and TensorFlow emerged as popular tools for implementing RNN models. To advance code clone detection research, further exploration of techniques like parallel LSTM, sentence-level LSTM, and Tree-Structured GRU is imperative. In addition, more research is needed to investigate the capabilities of the recurrent neural network techniques for identifying semantic clones across different programming languages and binary codes. The development of standardized benchmarks for languages like Python, Scratch, and C#, along with cross-language comparisons, is essential. Therefore, the utilization of recurrent neural network techniques for clone identification is a promising area that demands further research.
(Copyright: © 2024 Quradaa et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)

The authors have declared that no competing interests exist.