All Projects → Dont Stop Pretraining → Similar Projects or Alternatives

659 Open source projects that are alternatives of or similar to Dont Stop Pretraining

Pytorch Transformers Classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Stars: ✭ 229 (-5.76%)
Bert Vocab Builder
Builds wordpiece(subword) vocabulary compatible for Google Research's BERT
Stars: ✭ 187 (-23.05%)
Neat Vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Stars: ✭ 213 (-12.35%)
Deep Generative Models For Natural Language Processing
DGMs for NLP. A roadmap.
Stars: ✭ 185 (-23.87%)
Pykakasi
NLP: Convert Japanese Kana-kanji sentences into Kana-Roman in simple algorithm.
Stars: ✭ 238 (-2.06%)
Id Nlp Resource
A list of Indonesian NLP resources.
Stars: ✭ 185 (-23.87%)
Shifterator
Interpretable data visualizations for understanding how texts differ at the word level
Stars: ✭ 209 (-13.99%)
Dkpro Core
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
Stars: ✭ 184 (-24.28%)
Catalyst
🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Stars: ✭ 224 (-7.82%)
Texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+820.16%)
Kagnet
Knowledge-Aware Graph Networks for Commonsense Reasoning (EMNLP-IJCNLP 19)
Stars: ✭ 205 (-15.64%)
Bert Sklearn
a sklearn wrapper for Google's BERT model
Stars: ✭ 182 (-25.1%)
Tensorflow qrnn
QRNN implementation for TensorFlow
Stars: ✭ 241 (-0.82%)
Kb Infobot
A dialogue bot for information access
Stars: ✭ 181 (-25.51%)
Hardware Aware Transformers
[ACL 2020] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Stars: ✭ 206 (-15.23%)
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-25.51%)
Catalyst
Accelerated deep learning R&D
Stars: ✭ 2,804 (+1053.91%)
Cookiecutter Spacy Fastapi
Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Stars: ✭ 179 (-26.34%)
Pytorch Beam Search Decoding
PyTorch implementation of beam search decoding for seq2seq models
Stars: ✭ 204 (-16.05%)
Nel
Entity linking framework
Stars: ✭ 176 (-27.57%)
Pytorch Bert Crf Ner
KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity Recognition model for Korean)
Stars: ✭ 236 (-2.88%)
Cleannlp
R package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-28.4%)
Stringi
THE String Processing Package for R (with ICU)
Stars: ✭ 204 (-16.05%)
Transformers.jl
Julia Implementation of Transformer models
Stars: ✭ 173 (-28.81%)
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (-8.64%)
Multimodal Sentiment Analysis
Attention-based multimodal fusion for sentiment analysis
Stars: ✭ 172 (-29.22%)
Pytorch graph Rel
A PyTorch implementation of GraphRel
Stars: ✭ 204 (-16.05%)
Spark Nlp
State of the Art Natural Language Processing
Stars: ✭ 2,518 (+936.21%)
Jack
Jack the Reader
Stars: ✭ 242 (-0.41%)
Syfertext
A privacy preserving NLP framework
Stars: ✭ 170 (-30.04%)
Gluon Nlp
NLP made easy
Stars: ✭ 2,344 (+864.61%)
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-30.04%)
Bert4doc Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
Stars: ✭ 220 (-9.47%)
Data Science Toolkit
Collection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-30.45%)
Claf
CLaF: Open-Source Clova Language Framework
Stars: ✭ 196 (-19.34%)
Acl Anthology
Data and software for building the ACL Anthology.
Stars: ✭ 168 (-30.86%)
Deepnlp Models Pytorch
Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
Stars: ✭ 2,760 (+1035.8%)
Lineflow
⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-30.86%)
Polyai Models
Neural Models for Conversational AI
Stars: ✭ 195 (-19.75%)
Turkish Stemmer Python
🐍 Turkish Language Stemmer for Python
Stars: ✭ 165 (-32.1%)
Ai Job Resume
AI 算法岗简历模板
Stars: ✭ 219 (-9.88%)
Newsrecommender
A news recommendation system tailored for user communities
Stars: ✭ 164 (-32.51%)
Decanlp
The Natural Language Decathlon: A Multitask Challenge for NLP
Stars: ✭ 2,255 (+827.98%)
Lazynlp
Library to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+716.87%)
Cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
Stars: ✭ 238 (-2.06%)
Covid Papers Browser
Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖
Stars: ✭ 161 (-33.74%)
Parallax
Tool for interactive embeddings visualization
Stars: ✭ 192 (-20.99%)
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-34.98%)
Aidl kb
A Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Stars: ✭ 219 (-9.88%)
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (+721.4%)
Displacy Ent
💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Stars: ✭ 191 (-21.4%)
Mishkal
Mishkal is an arabic text vocalization software
Stars: ✭ 158 (-34.98%)
Prodigy Recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Stars: ✭ 229 (-5.76%)
Dostoevsky
Sentiment analysis library for russian language
Stars: ✭ 191 (-21.4%)
Bertviz
Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
Stars: ✭ 3,443 (+1316.87%)
Summarization Papers
Summarization Papers
Stars: ✭ 238 (-2.06%)
Malaya
Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (-1.65%)
Wordgcn
ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
Stars: ✭ 230 (-5.35%)
Lit
The Language Interpretability Tool: Interactively analyze NLP models for model understanding in an extensible and framework agnostic interface.
Stars: ✭ 2,721 (+1019.75%)
Germanwordembeddings
Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-22.22%)
61-120 of 659 similar projects