All Projects → danhper → bigcode-tools

danhper / bigcode-tools

Licence: MIT license
Set of tools to help working with "Big Code"

Programming Languages

python
139335 projects - #7 most used programming language
java
68154 projects - #9 most used programming language
scala
5932 projects
javascript
184084 projects - #8 most used programming language

bigcode-tools

CircleCI

A set of tools to help working with "Big Code".

This repository contains multiple tools to fetch source code, transform source code into AST, visualize generated ASTs or learn embedding for AST nodes.

The repository is currently composed of the current subprojects

Take a look at the tutorial to get started.

If you are using this for academic work, we would be thankful if you could cite the following paper.

@inproceedings{Perez:2019:CCD:3341883.3341965,
 author = {Perez, Daniel and Chiba, Shigeru},
 title = {Cross-language Clone Detection by Learning over Abstract Syntax Trees},
 booktitle = {Proceedings of the 16th International Conference on Mining Software Repositories},
 series = {MSR '19},
 year = {2019},
 location = {Montreal, Quebec, Canada},
 pages = {518--528},
 numpages = {11},
 url = {https://doi.org/10.1109/MSR.2019.00078},
 doi = {10.1109/MSR.2019.00078},
 acmid = {3341965},
 publisher = {IEEE Press},
 address = {Piscataway, NJ, USA},
 keywords = {clone detection, machine learning, source code representation},
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].