Clue中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+22.91%)
AiSpaceAiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0
Stars: ✭ 28 (-98.58%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+2725.24%)
Tokenizers💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Stars: ✭ 5,077 (+157.32%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (-90.47%)
Nlp RecipesNatural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+193.11%)
sisterSImple SenTence EmbeddeR
Stars: ✭ 66 (-96.65%)
vietnamese-robertaA Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (-98.88%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-98.94%)
Electra中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model
Stars: ✭ 132 (-93.31%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (-92.35%)
Nlp chinese corpus大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Stars: ✭ 6,656 (+237.35%)
DiscoveryMining Discourse Markers for Unsupervised Sentence Representation Learning
Stars: ✭ 48 (-97.57%)
COCO-LM[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Stars: ✭ 109 (-94.48%)
gap-text2sqlGAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training
Stars: ✭ 83 (-95.79%)
muse-as-serviceREST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.
Stars: ✭ 45 (-97.72%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-98.94%)
syntaxdotNeural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
Stars: ✭ 32 (-98.38%)
HugsVisionHugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (-92.19%)
Azureml BertEnd-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Stars: ✭ 342 (-82.67%)
mirror-bert[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.
Stars: ✭ 56 (-97.16%)
open clipAn open source implementation of CLIP.
Stars: ✭ 1,534 (-22.25%)
roberta-wwm-base-distillthis is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
Stars: ✭ 61 (-96.91%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (+62.65%)
AnnA Anki neuronal AppendixUsing machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Stars: ✭ 39 (-98.02%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-98.33%)
wechselCode for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Stars: ✭ 39 (-98.02%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-95.13%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+72.78%)
OpencogA framework for integrated Artificial Intelligence & Artificial General Intelligence (AGI)
Stars: ✭ 2,132 (+8.06%)
Keras BertImplementation of BERT that could load official pre-trained models for feature extraction and prediction
Stars: ✭ 2,264 (+14.75%)
OptimusOptimus: the first large-scale pre-trained VAE language model
Stars: ✭ 180 (-90.88%)
backpropBackprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Stars: ✭ 229 (-88.39%)
Cross-Lingual-MRCCross-Lingual Machine Reading Comprehension (EMNLP 2019)
Stars: ✭ 66 (-96.65%)
Bert PytorchGoogle AI 2018 BERT pytorch implementation
Stars: ✭ 4,642 (+135.28%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-94.63%)
GetlangNatural language detection package in pure Go
Stars: ✭ 110 (-94.42%)
Flappy EsFlappy Bird AI using Evolution Strategies
Stars: ✭ 140 (-92.9%)
AndOfficial Pytorch Implementation for ICML'19 paper: Unsupervised Deep Learning by Neighbourhood Discovery
Stars: ✭ 133 (-93.26%)
Awesome Bertbert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
Stars: ✭ 1,732 (-12.21%)
Lr Gan.pytorchPytorch code for our ICLR 2017 paper "Layered-Recursive GAN for image generation"
Stars: ✭ 145 (-92.65%)
Deepmappingcode/webpage for the DeepMapping project
Stars: ✭ 140 (-92.9%)
Roberta zhRoBERTa中文预训练模型: RoBERTa for Chinese
Stars: ✭ 1,953 (-1.01%)
Chars2vecCharacter-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-93.41%)
Unsupervised detectionAn Unsupervised Learning Framework for Moving Object Detection From Videos
Stars: ✭ 139 (-92.95%)
PretrainedPretrained is the most complete and frequently updated list of pretrained top-performing models. Tensorflow, Theano and others. Want to add your model? File an issue, and we will add it.
Stars: ✭ 129 (-93.46%)
StylisticpoetryCodes for Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement (EMNLP 2018)
Stars: ✭ 148 (-92.5%)
EfficientnetImplementation of EfficientNet model. Keras and TensorFlow Keras.
Stars: ✭ 1,920 (-2.69%)
E3d lstme3d-lstm; Eidetic 3D LSTM A Model for Video Prediction and Beyond
Stars: ✭ 129 (-93.46%)
Pretrained Language ModelPretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Stars: ✭ 2,033 (+3.04%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-92.95%)