All Projects → Policy Gradient → Similar Projects or Alternatives

769 Open source projects that are alternatives of or similar to Policy Gradient

Handful Of Trials Pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Stars: ✭ 112 (-17.04%)
Mutual labels:  reinforcement-learning
Safeopt
Safe Bayesian Optimization
Stars: ✭ 90 (-33.33%)
Mutual labels:  reinforcement-learning
Meta rl
The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.
Stars: ✭ 36 (-73.33%)
Mutual labels:  reinforcement-learning
Outlace.github.io
Machine learning and data science blog.
Stars: ✭ 65 (-51.85%)
Mutual labels:  reinforcement-learning
Malmo Challenge
Malmo Collaborative AI Challenge - Team Pig Catcher
Stars: ✭ 64 (-52.59%)
Inverse Reinforcement Learning
Implementations of selected inverse reinforcement learning algorithms.
Stars: ✭ 522 (+286.67%)
Mutual labels:  reinforcement-learning
Openaigym
Solving OpenAI Gym problems.
Stars: ✭ 98 (-27.41%)
Mutual labels:  reinforcement-learning
Visual Pushing Grasping
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Stars: ✭ 516 (+282.22%)
Tnt
Simple tools for logging and visualizing, loading and training
Stars: ✭ 1,298 (+861.48%)
Mutual labels:  reinforcement-learning
Stock Price Trade Analyzer
This is a Python 3.0 project for analyzing stock prices and methods of stock trading. It uses native Python tools and Google TensorFlow machine learning.
Stars: ✭ 35 (-74.07%)
Mutual labels:  reinforcement-learning
Holdem
🃏 OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning
Stars: ✭ 135 (+0%)
Mutual labels:  reinforcement-learning
Mario rl
Stars: ✭ 60 (-55.56%)
Mutual labels:  reinforcement-learning
Navbot
Using RGB Image as Visual Input for Mapless Robot Navigation
Stars: ✭ 111 (-17.78%)
Mutual labels:  reinforcement-learning
Categorical Dqn
A working implementation of the Categorical DQN (Distributional RL).
Stars: ✭ 90 (-33.33%)
Mutual labels:  reinforcement-learning
Artificialintelligenceengines
Computer code collated for use with Artificial Intelligence Engines book by JV Stone
Stars: ✭ 35 (-74.07%)
Mutual labels:  reinforcement-learning
Rosettastone
Hearthstone simulator using C++ with some reinforcement learning
Stars: ✭ 510 (+277.78%)
Mutual labels:  reinforcement-learning
Rlai Exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Stars: ✭ 97 (-28.15%)
Mutual labels:  reinforcement-learning
Seqgan
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (+271.85%)
Mutual labels:  policy-gradient
Nlp overview
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
Stars: ✭ 1,104 (+717.78%)
Mutual labels:  reinforcement-learning
Reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (+269.63%)
Mutual labels:  reinforcement-learning
Reinforcementlearninganintroduction.jl
Julia code for the book Reinforcement Learning An Introduction
Stars: ✭ 117 (-13.33%)
Mutual labels:  reinforcement-learning
Autokernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
Stars: ✭ 485 (+259.26%)
Mutual labels:  reinforcement-learning
Nlg Rl
Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Stars: ✭ 59 (-56.3%)
Mutual labels:  reinforcement-learning
Torchcraft
Connecting Torch to StarCraft
Stars: ✭ 1,341 (+893.33%)
Mutual labels:  reinforcement-learning
Learning2run
Our NIPS 2017: Learning to Run source code
Stars: ✭ 57 (-57.78%)
Mutual labels:  reinforcement-learning
Vowpal wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Stars: ✭ 7,815 (+5688.89%)
Mutual labels:  reinforcement-learning
Learning Notes
💡 Repo of learning notes in DRL and DL, theory, codes, models and notes maybe.
Stars: ✭ 90 (-33.33%)
Minecraft Reinforcement Learning
Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft
Stars: ✭ 33 (-75.56%)
Neurojs
A JavaScript deep learning and reinforcement learning library.
Stars: ✭ 4,344 (+3117.78%)
Mutual labels:  reinforcement-learning
Tictactoe
Tic Tac Toe Machine Learning
Stars: ✭ 56 (-58.52%)
Mutual labels:  reinforcement-learning
Researchpapernotes
Initiative to read research papers
Stars: ✭ 97 (-28.15%)
Mutual labels:  reinforcement-learning
Reinforcement Learning
Implementation of Reinforcement Learning algorithms in Python, based on Sutton's & Barto's Book (Ed. 2)
Stars: ✭ 55 (-59.26%)
Mutual labels:  reinforcement-learning
Conversational Ai
Conversational AI Reading Materials
Stars: ✭ 34 (-74.81%)
Mutual labels:  reinforcement-learning
Tetris Ai
A deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-19.26%)
Torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (-33.33%)
Mutual labels:  reinforcement-learning
Emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-77.04%)
Mutual labels:  reinforcement-learning
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (+236.3%)
Mutual labels:  reinforcement-learning
C51 Ddqn Keras
C51-DDQN in Keras
Stars: ✭ 115 (-14.81%)
Mutual labels:  reinforcement-learning
Torch Light
Deep-learning by using Pytorch. Basic nns like Logistic, CNN, RNN, LSTM and some examples are implemented by complex model.
Stars: ✭ 451 (+234.07%)
Mutual labels:  reinforcement-learning
Reinforcepy
Collection of reinforcement learners implemented in python. Mainly including DQN and its variants
Stars: ✭ 54 (-60%)
Mutual labels:  reinforcement-learning
Pwnagotchi
(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
Stars: ✭ 4,678 (+3365.19%)
Ngsim env
Learning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-28.89%)
Mutual labels:  reinforcement-learning
Deep Reinforcement Learning In Large Discrete Action Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
Stars: ✭ 132 (-2.22%)
Mapleai
AI各领域学习资料整理。(A collection of all skills and knowledges should be got command of to obtain an AI relevant job offer. There are online blogs, my personal blogs, electronic books copy.)
Stars: ✭ 89 (-34.07%)
Mutual labels:  reinforcement-learning
Pokerrl Omaha
Omaha Poker functionality+some features for PokerRL Reinforcement Learning card framwork
Stars: ✭ 31 (-77.04%)
Mutual labels:  reinforcement-learning
Policy Gradient Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-60%)
Mutual labels:  reinforcement-learning
Spot mini mini
Dynamics and Domain Randomized Gait Modulation with Bezier Curves for Sim-to-Real Legged Locomotion.
Stars: ✭ 426 (+215.56%)
Mutual labels:  reinforcement-learning
Starcraft Ai
Reinforcement Learning and Transfer Learning based StarCraft Micromanagement
Stars: ✭ 95 (-29.63%)
Mutual labels:  reinforcement-learning
Deep Reinforcement Learning Survey
My Exploration on Deep Reinforcement Learning Survey
Stars: ✭ 419 (+210.37%)
Mutual labels:  reinforcement-learning
Gym Minigrid
Minimalistic gridworld package for OpenAI Gym
Stars: ✭ 1,047 (+675.56%)
Mutual labels:  reinforcement-learning
Stable Baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-14.81%)
Mutual labels:  reinforcement-learning
Gym Panda
An OpenAI Gym Env for Panda
Stars: ✭ 29 (-78.52%)
Mutual labels:  reinforcement-learning
Pairstrade Fyp 2019
We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.
Stars: ✭ 109 (-19.26%)
Mutual labels:  reinforcement-learning
Hand dapg
Repository to accompany RSS 2018 paper on dexterous hand manipulation
Stars: ✭ 88 (-34.81%)
Mutual labels:  reinforcement-learning
Impala Distributed Tensorflow
Stars: ✭ 28 (-79.26%)
Mutual labels:  reinforcement-learning
Batch Ppo
Efficient Batched Reinforcement Learning in TensorFlow
Stars: ✭ 945 (+600%)
Mutual labels:  reinforcement-learning
Magnet
MAGNet: Multi-agents control using Graph Neural Networks
Stars: ✭ 88 (-34.81%)
Mutual labels:  reinforcement-learning
Gym
Seoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-80%)
Mutual labels:  reinforcement-learning
Keras Rl2
Reinforcement learning with tensorflow 2 keras
Stars: ✭ 134 (-0.74%)
Automata
A comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (-3.7%)
Mutual labels:  reinforcement-learning
301-360 of 769 similar projects