deepfrogAn NLP-suite powered by deep learning
language-plannerOfficial Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
pytorch-vitAn Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
molecule-attention-transformerPytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules
converseConversational text Analysis using various NLP techniques
modulesThe official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We develop a method for analyzing emerging functional modularity in neural networks based on differentiable weight masks and use it to point out important issues in current-day neural networks.
gnn-lspeSource code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations), ICLR 2022
DocSumA tool to automatically summarize documents abstractively using the BART or PreSumm Machine Learning Model.
xpandasUniversal 1d/2d data containers with Transformers functionality for data analysis.
WellcomeMLRepository for Machine Learning utils at the Wellcome Trust
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
long-short-transformerImplementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
deepconsensusDeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.
anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
wechselCode for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
transformers-interpretModel explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
pysentimientoA Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
transformer generalizationThe official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We significantly improve the systematic generalization of transformer models on a variety of datasets using simple tricks and careful considerations.
backpropBackprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
RETRO-pytorchImplementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
code-transformerImplementation of the paper "Language-agnostic representation learning of source code from structure and context".
awesome-huggingface🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
transformers-lightningA collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transformers.
X-TransformerX-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification
Ask2TransformersA Framework for Textual Entailment based Zero Shot text classification
oreilly-bert-nlpThis repository contains code for the O'Reilly Live Online Training for BERT
nlp workshop odsc europe20Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Transformer-MM-Explainability[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
uniformer-pytorchImplementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022
clip-italianCLIP (Contrastive Language–Image Pre-training) for Italian
Basic-UI-for-GPT-J-6B-with-low-vramA repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.
textUsing Transformers from HuggingFace in R
jax-modelsUnofficial JAX implementations of deep learning research papers
TransQuestTransformer based translation quality estimation
STAM-pytorchImplementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
naruNeural Relation Understanding: neural cardinality estimators for tabular data
KB-ALBERTKB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델
LIT[AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"
thermostatCollection of NLP model explanations and accompanying analysis tools
SnowflakeNet(TPAMI 2022) Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577