Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (-18.67%)
Starcraft AiReinforcement Learning and Transfer Learning based StarCraft Micromanagement
Stars: ✭ 95 (-42.77%)
Data Science FreeFree Resources For Data Science created by Shubham Kumar
Stars: ✭ 232 (+39.76%)
ganbertEnhancing the BERT training with Semi-supervised Generative Adversarial Networks
Stars: ✭ 205 (+23.49%)
Warehouse Robot Path PlanningA multi agent path planning solution under a warehouse scenario using Q learning and transfer learning.🤖️
Stars: ✭ 59 (-64.46%)
CPCE-3DLow-dose CT via Transfer Learning from a 2D Trained Network, In IEEE TMI 2018
Stars: ✭ 40 (-75.9%)
SimPLECode for the paper: "SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification"
Stars: ✭ 50 (-69.88%)
dqn-lambdaNeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
Stars: ✭ 20 (-87.95%)
CS231nPyTorch/Tensorflow solutions for Stanford's CS231n: "CNNs for Visual Recognition"
Stars: ✭ 47 (-71.69%)
Samsung Drl CodeRepository for codes of Deep Reinforcement Learning (DRL) lectured at Samsung
Stars: ✭ 99 (-40.36%)
Pytorch sac aePyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
Stars: ✭ 94 (-43.37%)
Chemgan ChallengeCode for the paper: Benhenda, M. 2017. ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv preprint arXiv:1708.08227.
Stars: ✭ 98 (-40.96%)
Reinforcement learning in pythonImplementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
Stars: ✭ 134 (-19.28%)
Gym Gazebo2gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (+54.82%)
RlgraphRLgraph: Modular computation graphs for deep reinforcement learning
Stars: ✭ 272 (+63.86%)
deep-Q-networksImplementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
Stars: ✭ 135 (-18.67%)
Pytorch DqnDeep Q-Learning Network in pytorch (not actively maintained)
Stars: ✭ 282 (+69.88%)
Deep rlPyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, SAC, SAC-AEA)
Stars: ✭ 291 (+75.3%)
reinforce-js[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Stars: ✭ 20 (-87.95%)
Reward Learning Rl[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
Stars: ✭ 310 (+86.75%)
Neural Symbolic MachinesNeural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
Stars: ✭ 305 (+83.73%)
Deeprl Tensorflow2🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Stars: ✭ 319 (+92.17%)
GdrlGrokking Deep Reinforcement Learning
Stars: ✭ 304 (+83.13%)
Tf RexPlay Google Chrome's T-rex game with TensorFlow
Stars: ✭ 345 (+107.83%)
Lagomlagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Stars: ✭ 364 (+119.28%)
Applied Reinforcement LearningReinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
Stars: ✭ 229 (+37.95%)
Tensorflow TutorialTensorflow tutorial from basic to hard, 莫烦Python 中文AI教学
Stars: ✭ 4,122 (+2383.13%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+6586.75%)
Rl BookSource codes for the book "Reinforcement Learning: Theory and Python Implementation"
Stars: ✭ 464 (+179.52%)
Tetris AiA deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-34.34%)
Video ClassificationTutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Stars: ✭ 543 (+227.11%)
Animalai OlympicsCode repository for the Animal AI Olympics competition
Stars: ✭ 544 (+227.71%)
Habitat LabA modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
Stars: ✭ 587 (+253.61%)
LeakganThe codes of paper "Long Text Generation via Adversarial Training with Leaked Information" on AAAI 2018. Text generation using GAN and Hierarchical Reinforcement Learning.
Stars: ✭ 533 (+221.08%)
ExermoteUsing Machine Learning to predict the type of exercise from movement data
Stars: ✭ 108 (-34.94%)
SoftlearningSoftlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Stars: ✭ 713 (+329.52%)
Ml AgentsUnity Machine Learning Agents Toolkit
Stars: ✭ 12,134 (+7209.64%)
Pytorch RlDeep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+348.8%)
GibsonenvGibson Environments: Real-World Perception for Embodied Agents
Stars: ✭ 666 (+301.2%)
MinimalrlImplementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Stars: ✭ 2,051 (+1135.54%)
Rl tradingAn environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (+23.49%)
Pytorch A2c Ppo Acktr GailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+1485.54%)
Paac.pytorchPytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862
Stars: ✭ 22 (-86.75%)
CausalworldCausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Stars: ✭ 76 (-54.22%)
TorchrlHighly Modular and Scalable Reinforcement Learning
Stars: ✭ 102 (-38.55%)
Ctc ExecutionerMaster Thesis: Limit order placement with Reinforcement Learning
Stars: ✭ 112 (-32.53%)