
Code similarity and its applications to Internet-scale Code Cloning
My research focuses on scalability of code similarity measurement (i.e. code clone detection).
Scalable code similarity detection is increasingly important nowadays due to popularity of
sharing code on online sources such as Stack Overflow or GitHub. There are studies showing
that reusing code snippets from Stack Overflow can be harmful by not only introducing vulnerabilities to
a developer's software but also causing software licensing conflicts.
I am now working on building Internet-scale Similar Code Search (ISiCS) tool that can query for
similar code snippets from online sources in instant time. ISiCS will benefit developers during
code review time by checking if there is any code cloned from the Internet, are they harmful,
and do they introduce licensing conflicts to the software.
Publications
C. Ragkhitwetsagul, J. Krinke, R. Oliveto (2017). Awareness and Experience of Developers to Outdated and License-Violating Code on Stack Overflow: An Online Survey. UCL Computer Science Research Note (RN/17/10), 2017.
Download: Here
C. Ragkhitwetsagul, J. Krinke, D.Clark (2017). A Comparison of Code Similarity Analysers. Empirical Software Engineering, To Appear, 2017.
Download: Preprint DOI: 10.1007/s10664-017-9564-7
Slideshow: Slideshare
M. Paixao, J. Krinke, D. Han, C. Ragkhitwetsagul and M. Harman (2017). Are Developers Aware of the Architectural Impact of Their Changes?. In the 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2017), Illinois, USA <To Appear>
Download: Preprint
C. Ragkhitwetsagul, J. Krinke (2017). Using Compilation/Decompilation to Enhance Clone Detection. In 11th International Workshop on Software Clones, 2017. Klagenfurt, Austria <To Appear> -- Won the People's Choice Award!
Download: Preprint Slideshow: SlideShare
C. Ragkhitwetsagul, J. Krinke, D. Clark (2016). Similarity of Source Code in the Presence of Pervasive Modifications. In 16th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM), 2016. North Carolina, USA.
Download: Preprint DOI: 10.1109/SCAM.2016.13 Slideshow: SlideShare
C. Ragkhitwetsagul (2016). Measuring Code Similarity in Large-scaled Code Corpora. In 32nd International Conference on Software Maintenance and Evolution (ICSME): Doctoral Symposium, 2016. North Carolina, USA.
Download: Preprint DOI: 10.1109/ICSME.2016.18
C. Ragkhitwetsagul, M. Paixao, M. Adham, S. Busari, J. Krinke, and J.H. Drake (2016). Searching for Configurations in Clone Evalution: A Replication Study. In 8th International Symposium on Search-based Software Engineering (SSBSE): Challenge Track, 2016. North Carolina, USA.
Download: Preprint DOI: 10.1007/978-3-319-47106-8_20 Slideshow: SlideShare
Below are publications of undergraduate students that I advised when I was at the faculty of ICT, Mahidol University:
P. Janviriya, T. Ongarjithichai, P. Numruktrakul, C. Ragkhitwetsagul (2014). CloudyDays : Cloud Storage Integration System. In Student Project Conference (ICT-ISPC), 2014 Third ICT International (pp. 125–128). Nakhonpathom, Thailand.
Download: DOI: 10.1109/ICT-ISPC.2014.6923233
P. Hathaiwichian, L. Siriwittayacharoen, A. Wongwachirawanich, C. Ragkhitwetsagul (2014). Android Application for Event Management and Information Propagation. In Student Project Conference (ICT-ISPC), 2014 Third ICT International (pp. 139–142). Nakhonpathom, Thailand.
Download: DOI: 10.1109/ICT-ISPC.2014.6923236
Presentations
21/2/2017: Using Compilation/Decompilation to Enhance Clone Detection: The slides of my talk at IWSC '17.
3/10/2016: Similarity of Source Code in the Presence of Pervasive Modifications: The slides of my talk at SCAM '16.
9/10/2016: Searching for Configurations in Clone Evalution: A Replication Study: The slides of my talk at SSBSE '16 (Challenge Track).
15/06/2016: Similarity of Source Code in the Presence of Pervasive Modifications: The slides of my talk at the 12th International Summer School on Software Engineering (Student Talk) covering the complete results of CloPlag experiment.
01/06/2015: CloPlag: A Study of Effects of Code Obfuscation to Similarity Detection Tools: Latest update of CloPlag study with more results! It was given at COW 42 Annual Research Review of CREST.
06/02/2015: CloPlag: A Study of Effects of Code Obfuscation to Clone/Plagiarism Detection Tools: A presentation of intial results from the experiment on effects of code obfuscation to current similarity detection tools. It was given at a CREST Monthly Meeting.
Worth Reading
Interesting papers or articles regarding doing a PhD and conducting reserach in general.
- Schwartz, M. A., THE IMPORTANCE OF STUPIDITY IN SCIENTIFIC RESEARCH. Journal of Cell Science. 2008; 121, 1771.
- Harman, M., Draft Guidelines for My Students on Writing Software Engineering Research Papers.
- Jones, S. P., How to write a great research paper. There is also his talk at Cambridge: here
- Peyton Jones, S. L., Hughes, J., & Launchbury, J. How to give a good research talk. 1993.
- Jeff Offutt, Editorial: Standards for reviewing papers. Softw. Test. Verif. Reliab. 2007; 17:135–136