SDLM-pytorchCode accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts
Stars: ✭ 27 (-92.68%)
Reward Learning Rl[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
Stars: ✭ 310 (-15.99%)
ai-n-queensSolving and GUI demonstration of traditional N-Queens Problem using Hill Climbing, Simulated Annealing, Local Beam Search, and Genetic Algorithm.
Stars: ✭ 30 (-91.87%)
Pytorch DdpgImplementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
Stars: ✭ 272 (-26.29%)
miniconsUtility for analyzing Transformer based representations of language.
Stars: ✭ 28 (-92.41%)
Azureml BertEnd-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Stars: ✭ 342 (-7.32%)
wolpertinger ddpgWolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
Stars: ✭ 44 (-88.08%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (-27.37%)
Seq2seq chatbot基于seq2seq模型的简单对话系统的tf实现,具有embedding、attention、beam_search等功能,数据集是Cornell Movie Dialogs
Stars: ✭ 308 (-16.53%)
Object-Goal-NavigationPytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"
Stars: ✭ 107 (-71%)
AtariPersistent advantage learning dueling double DQN for the Arcade Learning Environment
Stars: ✭ 261 (-29.27%)
transformerNeutron: A pytorch based implementation of Transformer and its variants.
Stars: ✭ 60 (-83.74%)
CurlCURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Stars: ✭ 346 (-6.23%)
rl pytorchDeep Reinforcement Learning Algorithms Implementation in PyTorch
Stars: ✭ 23 (-93.77%)
PlanetDeep Planning Network: Control from pixels by latent planning with learned dynamics
Stars: ✭ 257 (-30.35%)
DRL DeliveryDuelDeep Reinforcement Learning applied to a modern 3D video-game environment called Delivery Duel.
Stars: ✭ 30 (-91.87%)
Neural Symbolic MachinesNeural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
Stars: ✭ 305 (-17.34%)
dqn-lambdaNeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
Stars: ✭ 20 (-94.58%)
pysc2-rl-agentsStarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)
Stars: ✭ 124 (-66.4%)
Deep-RL-agentsNo description or website provided.
Stars: ✭ 27 (-92.68%)
Crypto RlDeep Reinforcement Learning toolkit: record and replay cryptocurrency limit order book data & train a DDQN agent
Stars: ✭ 328 (-11.11%)
MinTLMinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Stars: ✭ 61 (-83.47%)
Meta-SACAuto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Stars: ✭ 19 (-94.85%)
MaRLEnEMachine- and Reinforcement Learning ExtensioN for (game) Engines
Stars: ✭ 47 (-87.26%)
Gpt NeoxAn implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
Stars: ✭ 303 (-17.89%)
captioning chainerA fast implementation of Neural Image Caption by Chainer
Stars: ✭ 17 (-95.39%)
rl-medicalCommunicative Multiagent Deep Reinforcement Learning for Anatomical Landmark Detection using PyTorch.
Stars: ✭ 36 (-90.24%)
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (-1.36%)
robustnavEvaluating pre-trained navigation agents under corruptions
Stars: ✭ 18 (-95.12%)
CommNetan implementation of CommNet
Stars: ✭ 23 (-93.77%)
Deep-Quality-Value-FamilyOfficial implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning Algorithms": https://arxiv.org/abs/1909.01779 To appear at the next NeurIPS2019 DRL-Workshop
Stars: ✭ 12 (-96.75%)
Pytorch TrpoPyTorch implementation of Trust Region Policy Optimization
Stars: ✭ 303 (-17.89%)
Deep-Reinforcement-Learning-NotebooksThis Repository contains a series of google colab notebooks which I created to help people dive into deep reinforcement learning.This notebooks contain both theory and implementation of different algorithms.
Stars: ✭ 15 (-95.93%)
pytorch-hdqnHierarchical-DQN in pytorch (not actively maintained)
Stars: ✭ 36 (-90.24%)
Deeprl Tensorflow2🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (-13.55%)
FNet-pytorchUnofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Stars: ✭ 204 (-44.72%)
reinforce-js[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Stars: ✭ 20 (-94.58%)
language-plannerOfficial Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (-77.24%)
Deep rlPyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, SAC, SAC-AEA)
Stars: ✭ 291 (-21.14%)
AlphaNPIAdapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
Stars: ✭ 71 (-80.76%)
pytorchrlDeep Reinforcement Learning algorithms implemented in PyTorch
Stars: ✭ 47 (-87.26%)
Im2latexImage to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow
Stars: ✭ 342 (-7.32%)
semantic-guidanceCode for our CVPR-2021 paper on Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings.
Stars: ✭ 19 (-94.85%)
Transfer NlpNLP library designed for reproducible experimentation management
Stars: ✭ 287 (-22.22%)
AutoPentest-DRLAutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning
Stars: ✭ 196 (-46.88%)
FinRLFinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
Stars: ✭ 3,497 (+847.7%)
Black-Box-TuningICML'2022: Black-Box Tuning for Language-Model-as-a-Service
Stars: ✭ 99 (-73.17%)
Openai labAn experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Stars: ✭ 313 (-15.18%)
neural-chatAn AI chatbot using seq2seq
Stars: ✭ 30 (-91.87%)
rlflowA TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms
Stars: ✭ 20 (-94.58%)
Kogpt2Korean GPT-2 pretrained cased (KoGPT2)
Stars: ✭ 368 (-0.27%)
Tf2rlTensorFlow2 Reinforcement Learning
Stars: ✭ 353 (-4.34%)
RlzooA Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀
Stars: ✭ 342 (-7.32%)
Cadrl rosROS package for dynamic obstacle avoidance for ground robots trained with deep RL
Stars: ✭ 309 (-16.26%)