Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Stars: ✭ 2,236 (+5784.21%)

Mutual labels: bert, xlnet

PlanSum

[AAAI2021] Unsupervised Opinion Summarization with Content Planning

Stars: ✭ 25 (-34.21%)

Mutual labels: text-summarization, abstractive-summarization

xl-sum

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

Stars: ✭ 160 (+321.05%)

Mutual labels: text-summarization, abstractive-summarization

NLP-paper

🎨 🎨NLP 自然语言处理教程 🎨🎨 https://dataxujing.github.io/NLP-paper/

Stars: ✭ 23 (-39.47%)

Mutual labels: bert, xlnet

Entity2Topic

[NAACL2018] Entity Commonsense Representation for Neural Abstractive Summarization

Stars: ✭ 20 (-47.37%)

Mutual labels: text-summarization, abstractive-summarization

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a prediction service

Stars: ✭ 117 (+207.89%)

Mutual labels: transformers, bert

bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

Stars: ✭ 56 (+47.37%)

Mutual labels: transformers, bert

ParsBigBird

Persian Bert For Long-Range Sequences

Stars: ✭ 58 (+52.63%)

Mutual labels: transformers, bert

text2keywords

Trained T5 and T5-large model for creating keywords from text

Stars: ✭ 53 (+39.47%)

Mutual labels: transformers, t5

roberta-wwm-base-distill

this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large

Stars: ✭ 61 (+60.53%)

Mutual labels: bert, roberta

Text and Audio classification with Bert

Text Classification in Turkish Texts with Bert

Stars: ✭ 34 (-10.53%)

Mutual labels: transformers, bert

text2text

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (+394.74%)

Mutual labels: transformers, bert

ttt

A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+

Stars: ✭ 35 (-7.89%)

Mutual labels: transformers, t5

text2class

Multi-class text categorization using state-of-the-art pre-trained contextualized language models, e.g. BERT

Stars: ✭ 15 (-60.53%)

Mutual labels: transformers, bert

wechsel

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Stars: ✭ 39 (+2.63%)

Mutual labels: transformers, bert

Awesome Bert

bert nlp papers, applications and github resources, including the newst xlnet ， BERT、XLNet 相关论文和 github 项目

Stars: ✭ 1,732 (+4457.89%)

Mutual labels: bert, xlnet

Chinese Bert Wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Stars: ✭ 6,357 (+16628.95%)

Mutual labels: bert, roberta

tensorflow-ml-nlp-tf2

텐서플로2와 머신러닝으로 시작하는 자연어처리 (로지스틱회귀부터 BERT와 GPT3까지) 실습자료

Stars: ✭ 245 (+544.74%)

Mutual labels: bert, gpt2

gazeta

Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке

Stars: ✭ 25 (-34.21%)

Mutual labels: text-summarization, abstractive-summarization

policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Stars: ✭ 22 (-42.11%)

Mutual labels: transformers, bert

Haystack

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+8871.05%)

Mutual labels: transformers, bert

GoEmotions-pytorch

Pytorch Implementation of GoEmotions 😍😢😱

Stars: ✭ 95 (+150%)

Mutual labels: transformers, bert

chef-transformer

Chef Transformer 🍲 .

Stars: ✭ 29 (-23.68%)

Mutual labels: transformers, t5

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R

Stars: ✭ 39 (+2.63%)

Mutual labels: transformers, bert

Pytorch-NLU

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (+297.37%)

Mutual labels: transformers, bert

TorchBlocks

A PyTorch-based toolkit for natural language processing

Stars: ✭ 85 (+123.68%)

Mutual labels: transformers, bert

robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Stars: ✭ 34 (-10.53%)

Mutual labels: transformers, bert

Pytorch Sentiment Analysis

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

Stars: ✭ 3,209 (+8344.74%)

Mutual labels: transformers, bert

Fast Bert

Super easy library for BERT based NLP models

Stars: ✭ 1,678 (+4315.79%)

Mutual labels: transformers, bert

Nlp Architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Stars: ✭ 2,768 (+7184.21%)

Mutual labels: transformers, bert

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Stars: ✭ 2,828 (+7342.11%)

Mutual labels: transformers, bert

HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Stars: ✭ 154 (+305.26%)

Mutual labels: transformers, bert

Tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Stars: ✭ 5,077 (+13260.53%)

Mutual labels: transformers, bert

bangla-bert

Bangla-Bert is a pretrained bert model for Bengali language

Stars: ✭ 41 (+7.89%)

Mutual labels: transformers, bert

backprop

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Stars: ✭ 229 (+502.63%)

Mutual labels: transformers, bert

streamlit-light-leaflet

Streamlit quick & dirty Leaflet component that sends back coordinates on map click

Stars: ✭ 22 (-42.11%)

Mutual labels: prototype, streamlit

vietnamese-roberta

A Robustly Optimized BERT Pretraining Approach for Vietnamese

Stars: ✭ 22 (-42.11%)

Mutual labels: bert, roberta

Scripts-for-extractive-summarization

Scripts for an upcoming blog "Extractive vs. Abstractive Summarization" for RaRe Technologies.

Stars: ✭ 12 (-68.42%)

Mutual labels: text-summarization, extractive-summarization

les-military-mrc-rank7

莱斯杯：全国第二届“军事智能机器阅读”挑战赛 - Rank7 解决方案

Stars: ✭ 37 (-2.63%)

Mutual labels: bert, roberta

gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

Stars: ✭ 216 (+468.42%)

Mutual labels: transformers, bert

japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., Ltd.

Stars: ✭ 484 (+1173.68%)

Mutual labels: roberta, gpt2

KLUE

📖 Korean NLU Benchmark

Stars: ✭ 420 (+1005.26%)

Mutual labels: bert, roberta

semantic-document-relations

Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"

Stars: ✭ 21 (-44.74%)

Mutual labels: bert, xlnet

classy

classy is a simple-to-use library for building high-performance Machine Learning models in NLP.

Stars: ✭ 61 (+60.53%)

Mutual labels: transformers, bert

AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Stars: ✭ 28 (-26.32%)

Mutual labels: bert, xlnet

oreilly-bert-nlp

This repository contains code for the O'Reilly Live Online Training for BERT