All Categories → Machine Learning → natural-language-processing

Top 1176 natural-language-processing open source projects

Claf
CLaF: Open-Source Clova Language Framework
Pyhanlp
中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
Decanlp
The Natural Language Decathlon: A Multitask Challenge for NLP
Nlvr
Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
Parallax
Tool for interactive embeddings visualization
Pyss3
A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Displacy Ent
💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Delbot
It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Dostoevsky
Sentiment analysis library for russian language
Phrasal
A large-scale statistical machine translation system written in Java.
Arxivnotes
IssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いています.雑です.🚧 マークは編集中の論文です(事実上放置のものも多いです).🍡 マークは概要のみ書いてます(早く見れる的な意味で団子).
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Hunspell Dict Ko
Korean spellchecking dictionary for Hunspell
Deep Survey Text Classification
The project surveys 16+ Natural Language Processing (NLP) research papers that propose novel Deep Neural Network Models for Text Classification, based on Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). It also implements each of the models using Tensorflow and Keras.
Bert Vocab Builder
Builds wordpiece(subword) vocabulary compatible for Google Research's BERT
Neuralqa
NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Id Nlp Resource
A list of Indonesian NLP resources.
Glad
Global-Locally Self-Attentive Dialogue State Tracker
Dkpro Core
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Recurrent Convolutional Neural Network Text Classifier
My (slightly modified) Keras implementation of the Recurrent Convolutional Neural Network (RCNN) described here: http://www.aaai.org/ocs/index.php/AAAI/AAAI15/paper/view/9745.
Sentence Similarity
This repository contains various ways to calculate sentence vector similarity using NLP models
Deeptoxic
top 1% solution to toxic comment classification challenge on Kaggle.
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stopwords
Default English stopword lists from many different sources
Cookiecutter Spacy Fastapi
Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Cs224n 2019
My completed implementation solutions for CS224N 2019
Cleannlp
R package providing annotators and a normalized data model for natural language processing
Deep Math Machine Learning.ai
A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Knockknock
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Dive Into Dl Pytorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Syfertext
A privacy preserving NLP framework
Efaqa Corpus Zh
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Open Sesame
A frame-semantic parsing system based on a softmax-margin SegRNN.
Ernie
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
Acl Anthology
Data and software for building the ACL Anthology.
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Lineflow
⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Question generation
It is a question-generator model. It takes text and an answer as input and outputs a question.
Turkish Stemmer Python
🐍 Turkish Language Stemmer for Python
Fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Newsrecommender
A news recommendation system tailored for user communities
61-120 of 1176 natural-language-processing projects