Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+139.58%)
CoarijCorpus of Annual Reports in Japan
Stars: ✭ 55 (-71.35%)
Typing AssistantTyping Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.
Stars: ✭ 32 (-83.33%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-58.85%)
FakenewscorpusA dataset of millions of news articles scraped from a curated list of data sources.
Stars: ✭ 255 (+32.81%)
Efaqa Corpus Zh❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Stars: ✭ 170 (-11.46%)
QuantedaAn R package for the Quantitative Analysis of Textual Data
Stars: ✭ 647 (+236.98%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-17.71%)
Ua GecUA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Stars: ✭ 108 (-43.75%)
ProsodyHelsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-27.6%)
CleannlpR package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-9.37%)
Transformers.jlJulia Implementation of Transformer models
Stars: ✭ 173 (-9.9%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-2.08%)
Dkpro CoreCollection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Stars: ✭ 184 (-4.17%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+1211.46%)
SyfertextA privacy preserving NLP framework
Stars: ✭ 170 (-11.46%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+1064.58%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (-11.46%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-11.98%)
DostoevskySentiment analysis library for russian language
Stars: ✭ 191 (-0.52%)
Hunspell Dict KoKorean spellchecking dictionary for Hunspell
Stars: ✭ 187 (-2.6%)
Bert Sklearna sklearn wrapper for Google's BERT model
Stars: ✭ 182 (-5.21%)
Acl AnthologyData and software for building the ACL Anthology.
Stars: ✭ 168 (-12.5%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1171.35%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (-3.65%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-1.56%)
Deep Math Machine Learning.aiA blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.
Stars: ✭ 173 (-9.9%)
GladGlobal-Locally Self-Attentive Dialogue State Tracker
Stars: ✭ 185 (-3.65%)
Knockknock🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Stars: ✭ 2,304 (+1100%)
DelbotIt understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Stars: ✭ 191 (-0.52%)
Dive Into Dl Pytorch本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Stars: ✭ 14,234 (+7313.54%)
HntitlenatorTest your HN title against a neural network
Stars: ✭ 184 (-4.17%)
NotebooksJupyter Notebooks with Deep Learning Tutorials
Stars: ✭ 188 (-2.08%)
Open SesameA frame-semantic parsing system based on a softmax-margin SegRNN.
Stars: ✭ 170 (-11.46%)
ErnieSimple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
Stars: ✭ 170 (-11.46%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-0.52%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-13.02%)
Sentence SimilarityThis repository contains various ways to calculate sentence vector similarity using NLP models
Stars: ✭ 182 (-5.21%)
Lineflow⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-12.5%)
Question generationIt is a question-generator model. It takes text and an answer as input and outputs a question.
Stars: ✭ 166 (-13.54%)
Deep Survey Text ClassificationThe project surveys 16+ Natural Language Processing (NLP) research papers that propose novel Deep Neural Network Models for Text Classification, based on Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN). It also implements each of the models using Tensorflow and Keras.
Stars: ✭ 187 (-2.6%)
Kb InfobotA dialogue bot for information access
Stars: ✭ 181 (-5.73%)
FixyAmacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-14.06%)
Deeptoxictop 1% solution to toxic comment classification challenge on Kaggle.
Stars: ✭ 180 (-6.25%)
NewsrecommenderA news recommendation system tailored for user communities
Stars: ✭ 164 (-14.58%)
Nlp4rec PapersPaper list of NLP for recommender systems
Stars: ✭ 162 (-15.62%)
PhrasalA large-scale statistical machine translation system written in Java.
Stars: ✭ 190 (-1.04%)
Bert Vocab BuilderBuilds wordpiece(subword) vocabulary compatible for Google Research's BERT
Stars: ✭ 187 (-2.6%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-5.73%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+933.85%)
UdpipeR package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (-16.67%)
StopwordsDefault English stopword lists from many different sources
Stars: ✭ 179 (-6.77%)