Pytorch Transformers ClassificationBased on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Stars: ✭ 229 (-5.76%)
Bert Vocab BuilderBuilds wordpiece(subword) vocabulary compatible for Google Research's BERT
Stars: ✭ 187 (-23.05%)
Neat VisionNeat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Stars: ✭ 213 (-12.35%)
PykakasiNLP: Convert Japanese Kana-kanji sentences into Kana-Roman in simple algorithm.
Stars: ✭ 238 (-2.06%)
ShifteratorInterpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (-13.99%)
Dkpro CoreCollection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Stars: ✭ 184 (-24.28%)
Catalyst🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (-7.82%)
TexarToolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+820.16%)
KagnetKnowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)
Stars: ✭ 205 (-15.64%)
Bert Sklearna sklearn wrapper for Google's BERT model
Stars: ✭ 182 (-25.1%)
Kb InfobotA dialogue bot for information access
Stars: ✭ 181 (-25.51%)
Hardware Aware Transformers[ACL 2020] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Stars: ✭ 206 (-15.23%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-25.51%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+1053.91%)
Cookiecutter Spacy FastapiCookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Stars: ✭ 179 (-26.34%)
NelEntity linking framework
Stars: ✭ 176 (-27.57%)
Pytorch Bert Crf NerKoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)
Stars: ✭ 236 (-2.88%)
CleannlpR package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-28.4%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (-16.05%)
Transformers.jlJulia Implementation of Transformer models
Stars: ✭ 173 (-28.81%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+936.21%)
JackJack the Reader
Stars: ✭ 242 (-0.41%)
SyfertextA privacy preserving NLP framework
Stars: ✭ 170 (-30.04%)
Gluon NlpNLP made easy
Stars: ✭ 2,344 (+864.61%)
VntkVietnamese NLP Toolkit for Node
Stars: ✭ 170 (-30.04%)
Bert4doc ClassificationCode and source for paper ``How to Fine-Tune BERT for Text Classification?``
Stars: ✭ 220 (-9.47%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-30.45%)
ClafCLaF: Open-Source Clova Language Framework
Stars: ✭ 196 (-19.34%)
Acl AnthologyData and software for building the ACL Anthology.
Stars: ✭ 168 (-30.86%)
Deepnlp Models PytorchPytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
Stars: ✭ 2,760 (+1035.8%)
Lineflow⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-30.86%)
Polyai ModelsNeural Models for Conversational AI
Stars: ✭ 195 (-19.75%)
NewsrecommenderA news recommendation system tailored for user communities
Stars: ✭ 164 (-32.51%)
DecanlpThe Natural Language Decathlon: A Multitask Challenge for NLP
Stars: ✭ 2,255 (+827.98%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+716.87%)
Cmrc2018A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
Stars: ✭ 238 (-2.06%)
Covid Papers BrowserBrowse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖
Stars: ✭ 161 (-33.74%)
ParallaxTool for interactive embeddings visualization
Stars: ✭ 192 (-20.99%)
Nlp bahasa resourcesA Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-34.98%)
Aidl kbA Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (-9.88%)
Pytorch NlpBasic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+721.4%)
Displacy Ent💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Stars: ✭ 191 (-21.4%)
MishkalMishkal is an arabic text vocalization software
Stars: ✭ 158 (-34.98%)
Prodigy Recipes🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Stars: ✭ 229 (-5.76%)
DostoevskySentiment analysis library for russian language
Stars: ✭ 191 (-21.4%)
BertvizTool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
Stars: ✭ 3,443 (+1316.87%)
Malaya Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-1.65%)
WordgcnACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (-5.35%)
LitThe Language Interpretability Tool: Interactively analyze NLP models for model understanding in an extensible and framework agnostic interface.
Stars: ✭ 2,721 (+1019.75%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-22.22%)