All Projects → Deeprl_algorithms → Similar Projects or Alternatives

445 Open source projects that are alternatives of or similar to Deeprl_algorithms

Super Mario Bros Ppo Pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Stars: ✭ 649 (+569.07%)
Mutual labels:  ppo
tpprl
Code and data for "Deep Reinforcement Learning of Marked Temporal Point Processes", NeurIPS 2018
Stars: ✭ 68 (-29.9%)
Planet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
Stars: ✭ 257 (+164.95%)
scala-rl
Functional Reinforcement Learning in Scala.
Stars: ✭ 26 (-73.2%)
deep trading
This project aims to select a supervised algorithm that can predict stock prices basing on historical data and use the predictor generated to form trading strategies.
Stars: ✭ 18 (-81.44%)
Mutual labels:  policy-gradient
Smart-Traffic-Signals-in-India-using-Deep-Reinforcement-Learning-and-Advanced-Computer-Vision
We have used Deep Reinforcement Learning and Advanced Computer Vision techniques to for the creation of Smart Traffic Signals for Indian Roads. We have created the scripts for using SUMO as our environment for deploying all our RL models.
Stars: ✭ 131 (+35.05%)
AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
Stars: ✭ 71 (-26.8%)
Drl papernotes
Notes and comments about Deep Reinforcement Learning papers
Stars: ✭ 65 (-32.99%)
Chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
Stars: ✭ 931 (+859.79%)
Mutual labels:  dqn
Awesome Monte Carlo Tree Search Papers
A curated list of Monte Carlo tree search papers with implementations.
Stars: ✭ 387 (+298.97%)
Mutual labels:  policy-gradient
chainer-notebooks
Jupyter notebooks for Chainer hands-on
Stars: ✭ 23 (-76.29%)
Mutual labels:  dqn
learn-hippo
Python (pytorch) code for Lu, Q., Hasson U. & Norman K. A. (2021). When to retrieve and encode episodic memories: a neural network model of hippocampal-cortical interaction.
Stars: ✭ 12 (-87.63%)
Underflow
With underflow, create trafic light clusters that interact together to regulate circulation
Stars: ✭ 12 (-87.63%)
UAV obstacle avoidance controller
UAV Obstacle Avoidance using Deep Recurrent Reinforcement Learning with Temporal Attention
Stars: ✭ 61 (-37.11%)
Deepdrive
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (+547.42%)
Reinforcement-Learning-on-google-colab
Reinforcement Learning algorithm's using google-colab
Stars: ✭ 33 (-65.98%)
CommNet
an implementation of CommNet
Stars: ✭ 23 (-76.29%)
Ml From Scratch
Python implementations of some of the fundamental Machine Learning models and algorithms from scratch.
Stars: ✭ 20,624 (+21161.86%)
semantic-guidance
Code for our CVPR-2021 paper on Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings.
Stars: ✭ 19 (-80.41%)
Async Deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
Stars: ✭ 44 (-54.64%)
Paac.pytorch
Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862
Stars: ✭ 22 (-77.32%)
racing dreamer
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Stars: ✭ 31 (-68.04%)
rtrl
PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)
Stars: ✭ 57 (-41.24%)
AutoPentest-DRL
AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning
Stars: ✭ 196 (+102.06%)
Starcraft
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Stars: ✭ 372 (+283.51%)
pokeai
Develop ultimate AI Pokémon trainer
Stars: ✭ 18 (-81.44%)
On Policy
This is the official implementation of Multi-Agent PPO.
Stars: ✭ 63 (-35.05%)
Mutual labels:  ppo
mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Stars: ✭ 4,685 (+4729.9%)
Mutual labels:  mujoco
Tf chatbot seq2seq antilm
Seq2seq chatbot with attention and anti-language model to suppress generic response, option for further improve by deep reinforcement learning.
Stars: ✭ 369 (+280.41%)
connect4
Solving board games like Connect4 using Deep Reinforcement Learning
Stars: ✭ 33 (-65.98%)
Mutual labels:  policy-gradient
Learningx
Deep & Classical Reinforcement Learning + Machine Learning Examples in Python
Stars: ✭ 241 (+148.45%)
pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
Stars: ✭ 47 (-51.55%)
ADL2019
Applied Deep Learning (2019 Spring) @ NTU
Stars: ✭ 20 (-79.38%)
Mutual labels:  policy-gradient
godpaper
🐵 An AI chess-board-game framework(by many programming languages) implementations.
Stars: ✭ 40 (-58.76%)
drl grasping
Deep Reinforcement Learning for Robotic Grasping from Octrees
Stars: ✭ 160 (+64.95%)
Gam
A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).
Stars: ✭ 227 (+134.02%)
Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
Stars: ✭ 52 (-46.39%)
Mutual labels:  dqn
Trending Deep Learning
Top 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (+459.79%)
mujoco-ur5-model
Mujoco Model for UR5-Ridgeback-Robotiq Robot
Stars: ✭ 22 (-77.32%)
Mutual labels:  mujoco
FinRL
FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥
Stars: ✭ 3,497 (+3505.15%)
Pygame Learning Environment
PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.
Stars: ✭ 828 (+753.61%)
Text summurization abstractive methods
Multiple implementations for abstractive text summurization , using google colab
Stars: ✭ 359 (+270.1%)
Mutual labels:  policy-gradient
datascience-mashup
In this repo I will try to gather all of the projects related to data science with clean datasets and high accuracy models to solve real world problems.
Stars: ✭ 36 (-62.89%)
Chanlun
文件 笔和线段的一种划分.py,只需要把k线high,low数据输入,就能自动实现笔,线段,中枢,买卖点,走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难,有人说选股很容易,有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统,当你的影响可以被忽略,你才能实现,Jiang主席所谓之,闷声发大财。
Stars: ✭ 206 (+112.37%)
Gym trading
Stars: ✭ 87 (-10.31%)
Mutual labels:  dqn
Papers
Summaries of machine learning papers
Stars: ✭ 2,362 (+2335.05%)
maze solver
This project solves self-made maze in a variety of ways: A-star, Q-learning and Deep Q-network.
Stars: ✭ 24 (-75.26%)
Mutual labels:  dqn
Atari Model Zoo
A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and comparison across training algorithms.
Stars: ✭ 198 (+104.12%)
Ai Economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Stars: ✭ 537 (+453.61%)
Reinforcementlearning.jl
A reinforcement learning package for Julia
Stars: ✭ 192 (+97.94%)
rlflow
A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms
Stars: ✭ 20 (-79.38%)
Deepqlearning.jl
Implementation of the Deep Q-learning algorithm to solve MDPs
Stars: ✭ 38 (-60.82%)
DDPG
End to End Mobile Robot Navigation using DDPG (Continuous Control with Deep Reinforcement Learning) based on Tensorflow + Gazebo
Stars: ✭ 41 (-57.73%)
Pytorch Cpp Rl
PyTorch C++ Reinforcement Learning
Stars: ✭ 353 (+263.92%)
Mutual labels:  ppo
AI booklet CE-AUT
Booklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (-85.57%)
Malmo Challenge
Malmo Collaborative AI Challenge - Team Pig Catcher
Stars: ✭ 64 (-34.02%)
Tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
Stars: ✭ 6,796 (+6906.19%)
Mutual labels:  dqn
neural-mpc
No description or website provided.
Stars: ✭ 54 (-44.33%)
muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Stars: ✭ 126 (+29.9%)
RL
Reinforcement Learning Demos
Stars: ✭ 66 (-31.96%)
301-360 of 445 similar projects