All Projects → Tokenizers → Similar Projects or Alternatives

1136 Open source projects that are alternatives of or similar to Tokenizers

xpandas
Universal 1d/2d data containers with Transformers functionality for data analysis.
Stars: ✭ 25 (-99.51%)
Mutual labels:  transformers
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-96.65%)
iamQA
中文wiki百科QA阅读理解问答系统,使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型,还有W2V词向量搜索,使用torchserve部署
Stars: ✭ 46 (-99.09%)
Mutual labels:  bert
Data Science Toolkit
Collection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-96.67%)
open clip
An open source implementation of CLIP.
Stars: ✭ 1,534 (-69.79%)
Mutual labels:  language-model
Acl Anthology
Data and software for building the ACL Anthology.
Stars: ✭ 168 (-96.69%)
Chakin
Simple downloader for pre-trained word vectors
Stars: ✭ 323 (-93.64%)
Lineflow
⚡️A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
Stars: ✭ 168 (-96.69%)
mongolian-nlp
Useful resources for Mongolian NLP
Stars: ✭ 119 (-97.66%)
Mutual labels:  language-model
Turkish Stemmer Python
🐍 Turkish Language Stemmer for Python
Stars: ✭ 165 (-96.75%)
TEXTOIR
TEXTOIR is a flexible toolkit for open intent detection and discovery. (ACL 2021)
Stars: ✭ 31 (-99.39%)
Mutual labels:  bert
Newsrecommender
A news recommendation system tailored for user communities
Stars: ✭ 164 (-96.77%)
cscg
Code Generation as a Dual Task of Code Summarization.
Stars: ✭ 28 (-99.45%)
Mutual labels:  language-model
py-lingualytics
A text analytics library with support for codemixed data
Stars: ✭ 36 (-99.29%)
Mutual labels:  bert
Covid Papers Browser
Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖
Stars: ✭ 161 (-96.83%)
eve-bot
EVE bot, a customer service chatbot to enhance virtual engagement for Twitter Apple Support
Stars: ✭ 31 (-99.39%)
Mutual labels:  transformers
bert-sentiment
Fine-grained Sentiment Classification Using BERT
Stars: ✭ 49 (-99.03%)
Mutual labels:  bert
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-96.89%)
text-generation-transformer
text generation based on transformer
Stars: ✭ 36 (-99.29%)
Mutual labels:  bert
Pytorch Nlp
Basic Utilities for PyTorch Natural Language Processing (NLP)
Stars: ✭ 1,996 (-60.69%)
CoLAKE
COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding
Stars: ✭ 86 (-98.31%)
Mutual labels:  language-model
Mishkal
Mishkal is an arabic text vocalization software
Stars: ✭ 158 (-96.89%)
Ai Deadlines
⏰ AI conference deadline countdowns
Stars: ✭ 3,852 (-24.13%)
Gensim
Topic Modelling for Humans
Stars: ✭ 12,763 (+151.39%)
rasa milktea chatbot
Chatbot with bert chinese model, base on rasa framework(中文聊天机器人,结合bert意图分析,基于rasa框架)
Stars: ✭ 97 (-98.09%)
Mutual labels:  bert
kwx
BERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-99.35%)
Mutual labels:  bert
Visdial Rl
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Stars: ✭ 157 (-96.91%)
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (-90.94%)
Cs224n 2019 Solutions
Complete solutions for Stanford CS224n, winter, 2019
Stars: ✭ 436 (-91.41%)
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-91.96%)
Mutual labels:  language-model
Question generation
Neural question generation using transformers
Stars: ✭ 356 (-92.99%)
Vaaku2Vec
Language Modeling and Text Classification in Malayalam Language using ULMFiT
Stars: ✭ 68 (-98.66%)
Mutual labels:  language-model
pyVHDLParser
Streaming based VHDL parser.
Stars: ✭ 51 (-99%)
Mutual labels:  language-model
vietnamese-roberta
A Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (-99.57%)
Mutual labels:  bert
Rnn lstm from scratch
How to build RNNs and LSTMs from scratch with NumPy.
Stars: ✭ 156 (-96.93%)
alpine-linux-scripts
Alpine Linux Setup Scripts
Stars: ✭ 38 (-99.25%)
Mutual labels:  gpt
Deeplearning nlp
基于深度学习的自然语言处理库
Stars: ✭ 154 (-96.97%)
KoBERT-nsmc
Naver movie review sentiment classification with KoBERT
Stars: ✭ 57 (-98.88%)
Mutual labels:  bert
Paraphrase identification
Examine two sentences and determine whether they have the same meaning.
Stars: ✭ 154 (-96.97%)
Albert zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Stars: ✭ 3,500 (-31.06%)
Mutual labels:  bert
Postagga
A Library to parse natural language in pure Clojure and ClojureScript
Stars: ✭ 152 (-97.01%)
A-Personal-Arch-Installation-Guide
A Personal Arch Installation Guide In Case of Amnesia
Stars: ✭ 58 (-98.86%)
Mutual labels:  gpt
Chineseblue
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Stars: ✭ 149 (-97.07%)
transganformer
Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper
Stars: ✭ 137 (-97.3%)
Mutual labels:  transformers
Spacymoji
💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (-97.03%)
LM-CNLC
Chinese Natural Language Correction via Language Model
Stars: ✭ 15 (-99.7%)
Mutual labels:  language-model
My Cs Degree
A CS degree with a focus on full-stack ML engineering, 2020
Stars: ✭ 391 (-92.3%)
Pycantonese
Cantonese Linguistics and NLP in Python
Stars: ✭ 147 (-97.1%)
long-short-transformer
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
Stars: ✭ 103 (-97.97%)
Mutual labels:  transformers
Tree Transformer
Implementation of the paper Tree Transformer
Stars: ✭ 148 (-97.08%)
BangalASR
Transformer based Bangla Speech Recognition
Stars: ✭ 20 (-99.61%)
Mutual labels:  transformers
rnn-theano
RNN(LSTM, GRU) in Theano with mini-batch training; character-level language models in Theano
Stars: ✭ 68 (-98.66%)
Mutual labels:  language-model
transformer-models
Deep Learning Transformer models in MATLAB
Stars: ✭ 90 (-98.23%)
Mutual labels:  bert
Transformer-Implementations
Library - Vanilla, ViT, DeiT, BERT, GPT
Stars: ✭ 34 (-99.33%)
Mutual labels:  transformers
Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Stars: ✭ 1,813 (-64.29%)
Mutual labels:  transformers
Recurrent Entity Networks
TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
Stars: ✭ 276 (-94.56%)
MinTL
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Stars: ✭ 61 (-98.8%)
Mutual labels:  language-model
pd3f
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Stars: ✭ 132 (-97.4%)
Mutual labels:  language-model
TradeTheEvent
Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (-98.74%)
Mutual labels:  bert
KoBERT-Transformers
KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)
Stars: ✭ 162 (-96.81%)
Mutual labels:  transformers
601-660 of 1136 similar projects