Pytorch TrpoPyTorch Implementation of Trust Region Policy Optimization (TRPO)
Stars: ✭ 123 (-95.34%)
Deep rlPyTorch implementations of Deep Reinforcement Learning algorithms (DQN, DDQN, A2C, VPG, TRPO, PPO, DDPG, TD3, SAC, SAC-AEA)
Stars: ✭ 291 (-88.98%)
ScalphagozeroAn independent implementation of DeepMind's AlphaGoZero in Scala, using Deeplearning4J (DL4J)
Stars: ✭ 144 (-94.55%)
AutoPentest-DRLAutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning
Stars: ✭ 196 (-92.58%)
Pytorch DqnDeep Q-Learning Network in pytorch (not actively maintained)
Stars: ✭ 282 (-89.32%)
godpaper🐵 An AI chess-board-game framework(by many programming languages) implementations.
Stars: ✭ 40 (-98.48%)
Pysc2 ExamplesStarCraft II - pysc2 Deep Reinforcement Learning Examples
Stars: ✭ 722 (-72.65%)
RainbowRainbow: Combining Improvements in Deep Reinforcement Learning
Stars: ✭ 1,148 (-56.52%)
AI booklet CE-AUTBooklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (-99.47%)
SoftlearningSoftlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Stars: ✭ 713 (-72.99%)
muzeroA clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Stars: ✭ 126 (-95.23%)
CrowdNav DSRNN[ICRA 2021] Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning
Stars: ✭ 43 (-98.37%)
GibsonenvGibson Environments: Real-World Perception for Embodied Agents
Stars: ✭ 666 (-74.77%)
Spinning Up BasicBasic versions of agents from Spinning Up in Deep RL written in PyTorch
Stars: ✭ 155 (-94.13%)
Rl QuadcopterTeach a Quadcopter How to Fly!
Stars: ✭ 124 (-95.3%)
Pytorch NecPyTorch Implementation of Neural Episodic Control (NEC)
Stars: ✭ 67 (-97.46%)
TensorforceTensorforce: a TensorFlow library for applied reinforcement learning
Stars: ✭ 3,062 (+15.98%)
Carla-ppoThis repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
Stars: ✭ 122 (-95.38%)
Top Deep Learning Top 200 deep learning Github repositories sorted by the number of stars.
Stars: ✭ 1,365 (-48.3%)
pydata-london-2018Slides and notebooks for my tutorial at PyData London 2018
Stars: ✭ 22 (-99.17%)
DeepLearningFlappyFrogFlappy Frog hack using Deep Reinforcement Learning (Deep Q-learning). 暴力膜蛤不可取。
Stars: ✭ 16 (-99.39%)
Drl papernotesNotes and comments about Deep Reinforcement Learning papers
Stars: ✭ 65 (-97.54%)
alpha sigmaA pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
Stars: ✭ 134 (-94.92%)
DeepdriveDeepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (-76.21%)
distributedRLA framework for easy prototyping of distributed reinforcement learning algorithms
Stars: ✭ 93 (-96.48%)
Rl MedicalDeep Reinforcement Learning (DRL) agents applied to medical images
Stars: ✭ 123 (-95.34%)
On PolicyThis is the official implementation of Multi-Agent PPO.
Stars: ✭ 63 (-97.61%)
DrqDrQ: Data regularized Q
Stars: ✭ 268 (-89.85%)
D3rlpyAn offline deep reinforcement learning library
Stars: ✭ 139 (-94.73%)
Deep Trading AgentDeep Reinforcement Learning based Trading Agent for Bitcoin
Stars: ✭ 573 (-78.3%)
AtariPersistent advantage learning dueling double DQN for the Arcade Learning Environment
Stars: ✭ 261 (-90.11%)
FinRL PodracerCloud-native Financial Reinforcement Learning
Stars: ✭ 179 (-93.22%)
Trending Deep LearningTop 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (-79.43%)
Malmo ChallengeMalmo Collaborative AI Challenge - Team Pig Catcher
Stars: ✭ 64 (-97.58%)
Gym Gazebo2gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Stars: ✭ 257 (-90.27%)
ddpg bipedRepository for Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG) using TensorFlow.
Stars: ✭ 65 (-97.54%)
Ai EconomistFoundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Stars: ✭ 537 (-79.66%)
AgentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (-19.13%)
Snake Ai ReinforcementAI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).
Stars: ✭ 123 (-95.34%)
MaxCode for reproducing experiments in Model-Based Active Exploration, ICML 2019
Stars: ✭ 61 (-97.69%)
PlanetDeep Planning Network: Control from pixels by latent planning with learned dynamics
Stars: ✭ 257 (-90.27%)
Machine Learning And Data ScienceThis is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-94.81%)
ppo-pytorchProximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Stars: ✭ 83 (-96.86%)
RCNN MDPCode base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
Stars: ✭ 65 (-97.54%)
Meta-SACAuto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
Stars: ✭ 19 (-99.28%)
Learning2runOur NIPS 2017: Learning to Run source code
Stars: ✭ 57 (-97.84%)
learn-hippoPython (pytorch) code for Lu, Q., Hasson U. & Norman K. A. (2021). When to retrieve and encode episodic memories: a neural network model of hippocampal-cortical interaction.
Stars: ✭ 12 (-99.55%)
SARNetCode repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)
Stars: ✭ 14 (-99.47%)
Awesome Deep NeuroevolutionA collection of Deep Neuroevolution resources or evolutionary algorithms applying in Deep Learning (constantly updating)
Stars: ✭ 150 (-94.32%)