151. DetrEnd-to-End Object Detection with Transformers
152. PytextA natural language modeling framework based on PyTorch
155. CryptenA framework for Privacy Preserving Machine Learning
156. SvoiceWe provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
157. DenseposeA real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
159. DetectronFAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
163. FvcoreCollection of common code that's shared among different research projects in FAIR computer vision team.
164. DeepsdfLearning Continuous Signed Distance Functions for Shape Representation
166. Kill The BitsCode for: "And the bit goes down: Revisiting the quantization of neural networks"
167. FastmriA large-scale dataset of both raw MRI measurements and clinical MRI images
168. Habitat LabA modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
169. QuaternetProposes neural networks that can generate animation of virtual characters for different actions.
175. FasttextLibrary for fast text representation and classification.
180. Pytorch3dPyTorch3D is FAIR's library of reusable components for deep learning with 3D data
181. Classifier BalancingThis repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020
185. DprDense Passage Retriever - is a set of tools and models for open domain Q&A task.
186. MmfA modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
187. KiltLibrary for Knowledge Intensive Language Tasks
189. D2goD2Go is a toolkit for efficient deep learning
190. DrqaReading Wikipedia to Answer Open-Domain Questions
191. HydraHydra is a framework for elegantly configuring complex applications
192. GtnAutomatic differentiation with weighted finite-state transducers.
193. DenoiserReal Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
194. Open lthA repository in preparation for open-sourcing lottery ticket hypothesis code.
195. TabertThis repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
196. Music TranslationA UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.
197. Adaptive SoftmaxImplements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/1609.04309)
198. NleThe NetHack Learning Environment
199. FairseqFacebook AI Research Sequence-to-Sequence Toolkit
200. PhyrePHYRE is a benchmark for physical reasoning.