All Projects → chnsh → deep-semantic-code-search

chnsh / deep-semantic-code-search

Licence: MIT license
Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search application

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to deep-semantic-code-search

Hierarchical Attention Networks Pytorch
Hierarchical Attention Networks for document classification
Stars: ✭ 239 (+279.37%)
Mutual labels:  nlp-machine-learning
cross target stance classification
Keras implementation for CrossNet in paper "Cross-Target Stance Classification with Self-Attention Networks"
Stars: ✭ 21 (-66.67%)
Mutual labels:  nlp-machine-learning
topic modelling financial news
Topic modelling on financial news with Natural Language Processing
Stars: ✭ 51 (-19.05%)
Mutual labels:  nlp-machine-learning
presidential-rnn
Project 4 for Metis bootcamp. Objective was generation of character-level RNN trained on Donald Trump's statements using Keras. Also generated Markov chains, and quick pyTorch RNN as baseline. Attempted semi-supervised GAN, but was unable to test in time.
Stars: ✭ 26 (-58.73%)
Mutual labels:  nlp-machine-learning
query-wellformedness
25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (+26.98%)
Mutual labels:  nlp-machine-learning
NLP-Flask-Website
A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 Jan 2019
Stars: ✭ 43 (-31.75%)
Mutual labels:  nlp-machine-learning
Machine Learning Resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+258.73%)
Mutual labels:  nlp-machine-learning
DeepLearningReading
Deep Learning and Machine Learning mini-projects. Current Project: Deepmind Attentive Reader (rc-data)
Stars: ✭ 78 (+23.81%)
Mutual labels:  nlp-machine-learning
awesome-yoruba-nlp
📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.
Stars: ✭ 21 (-66.67%)
Mutual labels:  nlp-machine-learning
Sumrized
Automatic Text Summarization (English/Arabic).
Stars: ✭ 37 (-41.27%)
Mutual labels:  nlp-machine-learning
pfootprint
Political Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (-68.25%)
Mutual labels:  nlp-machine-learning
merkalysis
A marketing tool that helps you to market your products using organic marketing. This tool can potentially save you 1000s of dollars every year. The tool predicts the reach of your posts on social media and also suggests you hashtags for captions in such a way that it increases your reach.
Stars: ✭ 28 (-55.56%)
Mutual labels:  nlp-machine-learning
fake-news
This is a further development of the kdnuggets article on fake news classification by George McIntyre
Stars: ✭ 15 (-76.19%)
Mutual labels:  nlp-machine-learning
Paribhasha
paribhasha.herokuapp.com/
Stars: ✭ 21 (-66.67%)
Mutual labels:  nlp-machine-learning
empythy
Automated NLP sentiment predictions- batteries included, or use your own data
Stars: ✭ 17 (-73.02%)
Mutual labels:  nlp-machine-learning
Nemo
NeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+5749.21%)
Mutual labels:  nlp-machine-learning
SENet-for-Weakly-Supervised-Relation-Extraction
No description or website provided.
Stars: ✭ 39 (-38.1%)
Mutual labels:  nlp-machine-learning
Machine-Learning-Models
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (-52.38%)
Mutual labels:  nlp-machine-learning
Entity Embedding
Reference implementation of the paper "Word Embeddings for Entity-annotated Texts"
Stars: ✭ 19 (-69.84%)
Mutual labels:  nlp-machine-learning
scicle-stopclickbait
Userscript that changes Clickbait headlines by headlines more honest to the news it links to.
Stars: ✭ 16 (-74.6%)
Mutual labels:  nlp-machine-learning

Deep Semantic Code Search

Code for Paper: Paper

Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search application.

In these experiments, there are 2 parts:

  1. The first one uses an approach suggested in [1] and we train their architecture on our own python dataset.
  2. The second approach expands on the first one through methodology suggested in [2] and we achieve reasonably good results.

We can observe that some sort of semantic information is captured the results:

Query Results

Instructions on reproducing our results

Implementation of [1] is within Joint Training Model and [2] is within Code Summarization Transfer Learning

Dataset

For [1], our dataset is provided within Joint Training Model. For [2], the full dataset is available on Google Cloud Platform.

For how to access data on GCP, please follow this link https://cloud.google.com/storage/docs/access-public-data

References:

[1] https://guxd.github.io/papers/deepcs.pdf

[2] https://towardsdatascience.com/semantic-code-search-3cd6d244a39c

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].