All Projects → Tokenizers → Similar Projects or Alternatives

1136 Open source projects that are alternatives of or similar to Tokenizers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+997.93%)

Mutual labels: natural-language-processing, language-model, natural-language-understanding, bert

Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.

Stars: ✭ 203 (-96%)

Mutual labels: natural-language-processing, language-model, natural-language-understanding

wechsel

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Stars: ✭ 39 (-99.23%)

Mutual labels: transformers, language-model, bert

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Stars: ✭ 229 (-95.49%)

Mutual labels: transformers, language-model, bert

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a prediction service

Stars: ✭ 117 (-97.7%)

Mutual labels: transformers, bert, natural-language-understanding

classy

classy is a simple-to-use library for building high-performance Machine Learning models in NLP.

Stars: ✭ 61 (-98.8%)

Mutual labels: transformers, bert, natural-language-understanding

Easy Bert

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

Stars: ✭ 106 (-97.91%)

Mutual labels: natural-language-processing, language-model, natural-language-understanding

Clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Stars: ✭ 2,425 (-52.24%)

Mutual labels: language-model, transformers, bert

text2class

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-99.7%)

Mutual labels: transformers, bert, natural-language-understanding

Spacy Transformers

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Stars: ✭ 919 (-81.9%)

Mutual labels: natural-language-processing, language-model, natural-language-understanding

COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Stars: ✭ 109 (-97.85%)

Mutual labels: transformers, language-model, natural-language-understanding

Chars2vec

Character-based word embeddings model based on RNN for handling real world texts

Stars: ✭ 130 (-97.44%)

Mutual labels: natural-language-processing, language-model, natural-language-understanding

Bert As Service

Mapping a variable-length sentence to a fixed-length vector using BERT model

Stars: ✭ 9,779 (+92.61%)

Mutual labels: natural-language-processing, natural-language-understanding, bert

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (-32.85%)

Mutual labels: language-model, transformers, bert

Spark Nlp

State of the Art Natural Language Processing

Stars: ✭ 2,518 (-50.4%)

Mutual labels: natural-language-processing, transformers, bert

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (-36.79%)

Mutual labels: natural-language-processing, transformers, bert

Bert Pytorch

Google AI 2018 BERT pytorch implementation

Stars: ✭ 4,642 (-8.57%)

Mutual labels: language-model, bert

TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Stars: ✭ 209 (-95.88%)

Mutual labels: gpt, bert

question generator

An NLP system for generating reading comprehension questions

Stars: ✭ 188 (-96.3%)

Mutual labels: transformers, bert

FasterTransformer

Transformer related optimization, including BERT, GPT

Stars: ✭ 1,571 (-69.06%)

Mutual labels: gpt, bert

oreilly-bert-nlp

This repository contains code for the O'Reilly Live Online Training for BERT

Stars: ✭ 19 (-99.63%)

Mutual labels: transformers, bert

Romanian-Transformers

This repo is the home of Romanian Transformers.

Stars: ✭ 60 (-98.82%)

Mutual labels: language-model, bert

Ernie

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Stars: ✭ 4,659 (-8.23%)

Mutual labels: natural-language-processing, bert

Tf Seq2seq

Sequence to sequence learning using TensorFlow.

Stars: ✭ 387 (-92.38%)

Mutual labels: natural-language-processing, natural-language-understanding

anonymisation

Anonymization of legal cases (Fr) based on Flair embeddings

Stars: ✭ 85 (-98.33%)

Mutual labels: transformers, bert

Text-Summarization

Abstractive and Extractive Text summarization using Transformers.

Stars: ✭ 38 (-99.25%)

Mutual labels: transformers, bert

bert extension tf

BERT Extension in TensorFlow

Stars: ✭ 29 (-99.43%)

Mutual labels: bert, natural-language-understanding

Fill-the-GAP

[ACL-WS] 4th place solution to gendered pronoun resolution challenge on Kaggle

Stars: ✭ 13 (-99.74%)

Mutual labels: bert, natural-language-understanding

Basic-UI-for-GPT-J-6B-with-low-vram

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

Stars: ✭ 90 (-98.23%)

Mutual labels: transformers, gpt

KB-ALBERT

KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델

Stars: ✭ 215 (-95.77%)

Mutual labels: transformers, language-model

bert-movie-reviews-sentiment-classifier

Build a Movie Reviews Sentiment Classifier with Google's BERT Language Model

Stars: ✭ 12 (-99.76%)

Mutual labels: language-model, bert

DiscEval

Discourse Based Evaluation of Language Understanding

Stars: ✭ 18 (-99.65%)

Mutual labels: bert, natural-language-understanding

Practical Nlp

Official Repository for 'Practical Natural Language Processing' by O'Reilly Media

Stars: ✭ 452 (-91.1%)

Mutual labels: natural-language-processing, natural-language-understanding

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (-95.75%)

Mutual labels: transformers, bert

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Stars: ✭ 2,828 (-44.3%)

Mutual labels: transformers, bert

minGPT-TF

A minimal TF2 re-implementation of the OpenAI GPT training

Stars: ✭ 36 (-99.29%)

Mutual labels: gpt, language-model

NLP-paper

🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/

Stars: ✭ 23 (-99.55%)

Mutual labels: gpt, bert

Pytorch-NLU

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (-97.03%)

Mutual labels: transformers, bert

gpt-j-api

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Stars: ✭ 248 (-95.12%)

Mutual labels: gpt, language-model

language-planner

Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"

Stars: ✭ 84 (-98.35%)

Mutual labels: transformers, language-model

GoEmotions-pytorch

Pytorch Implementation of GoEmotions 😍😢😱

Stars: ✭ 95 (-98.13%)

Mutual labels: transformers, bert

COVID-19-Tweet-Classification-using-Roberta-and-Bert-Simple-Transformers

Rank 1 / 216

Stars: ✭ 24 (-99.53%)

Mutual labels: transformers, bert

bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

Stars: ✭ 56 (-98.9%)

Mutual labels: transformers, bert

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (-98.33%)

Mutual labels: transformers, bert

robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Stars: ✭ 34 (-99.33%)

Mutual labels: transformers, bert

Botlibre

An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.

Stars: ✭ 412 (-91.88%)

Mutual labels: natural-language-processing, natural-language-understanding

WSDM-Cup-2019

[ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.

Stars: ✭ 62 (-98.78%)

Mutual labels: bert, natural-language-understanding

minicons

Utility for analyzing Transformer based representations of language.

Stars: ✭ 28 (-99.45%)

Mutual labels: transformers, language-model

Text and Audio classification with Bert

Text Classification in Turkish Texts with Bert

Stars: ✭ 34 (-99.33%)

Mutual labels: transformers, bert

OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (-98.15%)

Mutual labels: transformers, bert

bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

Stars: ✭ 41 (-99.19%)

Mutual labels: transformers, bert

HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Stars: ✭ 154 (-96.97%)

Mutual labels: transformers, bert

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

Stars: ✭ 39 (-99.23%)

Mutual labels: transformers, bert

text2text

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (-96.3%)

Mutual labels: transformers, bert

erc

Emotion recognition in conversation

Stars: ✭ 34 (-99.33%)

Mutual labels: transformers, bert

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-99.57%)

Mutual labels: transformers, bert

Bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

Stars: ✭ 273 (-94.62%)

Mutual labels: natural-language-processing, language-model

Autonlp

🤗 AutoNLP: train state-of-the-art natural language processing models and deploy them in a scalable environment automatically

Stars: ✭ 263 (-94.82%)

Mutual labels: natural-language-processing, natural-language-understanding

Oie Resources

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.