BrikerMan / Kashgari

Licence: apache-2.0

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Programming Languages

python

139335 projects - #7 most used programming language

shell

77523 projects

Projects that are alternatives of or similar to Kashgari

Cluener2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

Stars: ✭ 689 (-69.17%)

Mutual labels: named-entity-recognition, seq2seq, ner, sequence-labeling

Pytorch-NLU

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (-93.24%)

Mutual labels: text-classification, named-entity-recognition, bert, sequence-labeling

Spark Nlp

State of the Art Natural Language Processing

Stars: ✭ 2,518 (+12.66%)

Mutual labels: named-entity-recognition, seq2seq, text-classification, bert

Autoner

Learning Named Entity Tagger from Domain-Specific Dictionary

Stars: ✭ 357 (-84.03%)

Mutual labels: named-entity-recognition, ner, sequence-labeling

Bert seq2seq

pytorch实现bert做seq2seq任务，使用unilm方案,现在也可以做自动摘要，文本分类，情感分析，NER，词性标注等任务,支持GPT2进行文章续写。

Stars: ✭ 298 (-86.67%)

Mutual labels: text-classification, seq2seq, ner

Snips Nlu

Snips Python library to extract meaning from text

Stars: ✭ 3,583 (+60.31%)

Mutual labels: text-classification, named-entity-recognition, ner

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Stars: ✭ 164 (-92.66%)

Mutual labels: bert, sequence-labeling, bert-model

Bert Multitask Learning

BERT for Multitask Learning

Stars: ✭ 380 (-83%)

Mutual labels: text-classification, named-entity-recognition, ner

Spacy Streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Stars: ✭ 360 (-83.89%)

Mutual labels: text-classification, named-entity-recognition, ner

Dan Jurafsky Chris Manning Nlp

My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.

Stars: ✭ 124 (-94.45%)

Mutual labels: text-classification, named-entity-recognition, ner

Ld Net

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Stars: ✭ 148 (-93.38%)

Mutual labels: named-entity-recognition, ner, sequence-labeling

Delft

a Deep Learning Framework for Text

Stars: ✭ 289 (-87.07%)

Mutual labels: text-classification, ner, sequence-labeling

Bertweet

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

Stars: ✭ 282 (-87.38%)

Mutual labels: text-classification, named-entity-recognition, ner

Bert Bilstm Crf Ner

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Stars: ✭ 3,838 (+71.72%)

Mutual labels: named-entity-recognition, ner, bert

SIGIR2021 Conure

One Person, One Model, One World: Learning Continual User Representation without Forgetting

Stars: ✭ 23 (-98.97%)

Mutual labels: transfer-learning, bert, bert-model

Ncrfpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Stars: ✭ 1,767 (-20.94%)

Mutual labels: named-entity-recognition, ner, sequence-labeling

Macadam

Macadam是一个以Tensorflow(Keras)和bert4keras为基础，专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。

Stars: ✭ 149 (-93.33%)

Mutual labels: text-classification, ner, sequence-labeling

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (-96.2%)

Mutual labels: text-classification, named-entity-recognition, bert

Filipino-Text-Benchmarks

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

Stars: ✭ 22 (-99.02%)

Mutual labels: text-classification, transfer-learning, bert

Nlp Experiments In Pytorch

PyTorch repository for text categorization and NER experiments in Turkish and English.

Stars: ✭ 35 (-98.43%)

Mutual labels: text-classification, named-entity-recognition, ner

View All Similar Projects ➔

Kashgari

Overview | Performance | Installation | Documentation | Contributing

🎉🎉🎉 We released the 2.0.0 version with TF2 Support. 🎉🎉🎉

If you use this project for your research, please cite:

@misc{Kashgari
  author = {Eliyar Eziz},
  title = {Kashgari},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/BrikerMan/Kashgari}}
}

Overview

Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.

Human-friendly. Kashgari's code is straightforward, well documented and tested, which makes it very easy to understand and modify.
Powerful and simple. Kashgari allows you to apply state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), part-of-speech tagging (PoS) and classification.
Built-in transfer learning. Kashgari built-in pre-trained BERT and Word2vec embedding models, which makes it very simple to transfer learning to train your model.
Fully scalable. Kashgari provides a simple, fast, and scalable environment for fast experimentation, train your models and experiment with new approaches using different embeddings and model structure.
Production Ready. Kashgari could export model with SavedModel format for tensorflow serving, you could directly deploy it on the cloud.

Our Goal

Academic users Easier experimentation to prove their hypothesis without coding from scratch.
NLP beginners Learn how to build an NLP project with production level code quality.
NLP developers Build a production level classification/labeling model within minutes.

Performance

Welcome to add performance report.

Task	Language	Dataset	Score
Named Entity Recognition	Chinese	People's Daily Ner Corpus	95.57
Text Classification	Chinese	SMP2018ECDTCorpus	94.57

Installation

The project is based on Python 3.6+, because it is 2019 and type hinting is cool.

Backend	kashgari version	desc
TensorFlow 2.2+	`pip install 'kashgari>=2.0.2'`	TF2.10+ with tf.keras
TensorFlow 1.14+	`pip install 'kashgari>=1.0.0,<2.0.0'`	TF1.14+ with tf.keras
Keras	`pip install 'kashgari<1.0.0'`	keras version

You also need to install tensorflow_addons with TensorFlow.

TensorFlow Version	tensorflow_addons version
TensorFlow 2.1	`pip install tensorflow_addons==0.9.1`
TensorFlow 2.2	`pip install tensorflow_addons==0.11.2`
TensorFlow 2.3, 2.4, 2.5	`pip install tensorflow_addons==0.13.0`

Tutorials

Here is a set of quick tutorials to get you started with the library:

There are also articles and posts that illustrate how to use Kashgari:

Examples:

Neural machine translation with Seq2Seq

Contributors ✨

Thanks goes to these wonderful people. And there are many ways to get involved. Start with the contributor guidelines and then check these open issues for specific tasks.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

BrikerMan / Kashgari

Programming Languages

Labels

Projects that are alternatives of or similar to Kashgari

Kashgari

Overview | Performance | Installation | Documentation | Contributing

Overview

Our Goal

Performance

Installation

Tutorials

Contributors ✨