Chars2vecCharacter-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-32.29%)
Wikipedia2vecA tool for learning vector representations of words and entities from Wikipedia
Stars: ✭ 655 (+241.15%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+139.58%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+220.31%)
BpembPre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Stars: ✭ 909 (+373.44%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+626.04%)
Catalyst🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (+16.67%)
Ner LstmNamed Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (+177.08%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-2.08%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+673.96%)
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+939.58%)
CleannlpR package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-9.37%)
Transformers.jlJulia Implementation of Transformer models
Stars: ✭ 173 (-9.9%)
NotebooksJupyter Notebooks with Deep Learning Tutorials
Stars: ✭ 188 (-2.08%)
Dkpro CoreCollection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Stars: ✭ 184 (-4.17%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1211.46%)
Dive Into Dl Pytorch本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Stars: ✭ 14,234 (+7313.54%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+1064.58%)
Efaqa Corpus Zh❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Stars: ✭ 170 (-11.46%)
Open SesameA frame-semantic parsing system based on a softmax-margin SegRNN.
Stars: ✭ 170 (-11.46%)
PhrasalA large-scale statistical machine translation system written in Java.
Stars: ✭ 190 (-1.04%)
Deep Survey Text ClassificationThe project surveys 16+ Natural Language Processing (NLP) research papers that propose novel Deep Neural Network Models for Text Classification, based on Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). It also implements each of the models using Tensorflow and Keras.
Stars: ✭ 187 (-2.6%)
Bert Sklearna sklearn wrapper for Google's BERT model
Stars: ✭ 182 (-5.21%)
ErnieSimple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
Stars: ✭ 170 (-11.46%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1171.35%)
Datastories Semeval2017 Task4Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
Stars: ✭ 184 (-4.17%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-13.02%)
Deep Math Machine Learning.aiA blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-9.9%)
GladGlobal-Locally Self-Attentive Dialogue State Tracker
Stars: ✭ 185 (-3.65%)
Knockknock🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Stars: ✭ 2,304 (+1100%)
DostoevskySentiment analysis library for russian language
Stars: ✭ 191 (-0.52%)
JodieA PyTorch implementation of ACM SIGKDD 2019 paper "Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks"
Stars: ✭ 172 (-10.42%)
HntitlenatorTest your HN title against a neural network
Stars: ✭ 184 (-4.17%)
SyfertextA privacy preserving NLP framework
Stars: ✭ 170 (-11.46%)
Hunspell Dict KoKorean spellchecking dictionary for Hunspell
Stars: ✭ 187 (-2.6%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (-11.46%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-11.98%)
Displacy Ent💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Stars: ✭ 191 (-0.52%)
Acl AnthologyData and software for building the ACL Anthology.
Stars: ✭ 168 (-12.5%)
Sentence SimilarityThis repository contains various ways to calculate sentence vector similarity using NLP models
Stars: ✭ 182 (-5.21%)
Bert Vocab BuilderBuilds wordpiece(subword) vocabulary compatible for Google Research's BERT
Stars: ✭ 187 (-2.6%)
Lineflow⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-12.5%)
Kb InfobotA dialogue bot for information access
Stars: ✭ 181 (-5.73%)
Question generationIt is a question-generator model. It takes text and an answer as input and outputs a question.
Stars: ✭ 166 (-13.54%)
Deeptoxictop 1% solution to toxic comment classification challenge on Kaggle.
Stars: ✭ 180 (-6.25%)
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-14.06%)
NewsrecommenderA news recommendation system tailored for user communities
Stars: ✭ 164 (-14.58%)
ArxivnotesIssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いています.雑です.🚧 マークは編集中の論文です(事実上放置のものも多いです).🍡 マークは概要のみ書いてます(早く見れる的な意味で団子).
Stars: ✭ 190 (-1.04%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-5.73%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-14.58%)
Nlp4rec PapersPaper list of NLP for recommender systems
Stars: ✭ 162 (-15.62%)
StopwordsDefault English stopword lists from many different sources
Stars: ✭ 179 (-6.77%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+933.85%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-16.67%)