All Projects → Pytorch A2c Ppo Acktr Gail → Similar Projects or Alternatives

835 Open source projects that are alternatives of or similar to Pytorch A2c Ppo Acktr Gail

Policy Gradient Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-97.95%)
Mutual labels:  reinforcement-learning
Parl
A high-performance distributed training framework for Reinforcement Learning
Stars: ✭ 2,348 (-10.79%)
Mutual labels:  reinforcement-learning
Gym Continuousdoubleauction
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Stars: ✭ 50 (-98.1%)
Mutual labels:  ppo
Drl paper summary
Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.
Stars: ✭ 49 (-98.14%)
Pomdpy
POMDPs in Python.
Stars: ✭ 183 (-93.05%)
Mutual labels:  reinforcement-learning
Gbrain
GPU Javascript Library for Machine Learning
Stars: ✭ 48 (-98.18%)
Mutual labels:  reinforcement-learning
Drl Portfolio Management
CSCI 599 deep learning and its applications final project
Stars: ✭ 121 (-95.4%)
World Models Sonic Pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
Stars: ✭ 27 (-98.97%)
Mutual labels:  reinforcement-learning
Navbot
Using RGB Image as Visual Input for Mapless Robot Navigation
Stars: ✭ 111 (-95.78%)
Mutual labels:  reinforcement-learning
Doyouevenlearn
Essential Guide to keep up with AI/ML/DL/CV
Stars: ✭ 913 (-65.31%)
Mutual labels:  reinforcement-learning
Rl trading
An environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (-92.21%)
Mutual labels:  reinforcement-learning
Udacity Deep Learning Nanodegree
This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY
Stars: ✭ 15 (-99.43%)
Mutual labels:  reinforcement-learning
Tensorflow rlre
Reinforcement Learning for Relation Classification from Noisy Data(TensorFlow)
Stars: ✭ 150 (-94.3%)
Mutual labels:  reinforcement-learning
Hawq
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
Stars: ✭ 108 (-95.9%)
Mutual labels:  hessian
Evolutionary Algorithm
Evolutionary Algorithm using Python, 莫烦Python 中文AI教学
Stars: ✭ 881 (-66.53%)
Mutual labels:  reinforcement-learning
Gym Gridworlds
Gridworld environments for OpenAI gym.
Stars: ✭ 43 (-98.37%)
Mutual labels:  reinforcement-learning
Muzero
A structured implementation of MuZero
Stars: ✭ 156 (-94.07%)
Mutual labels:  reinforcement-learning
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-98.4%)
Mutual labels:  reinforcement-learning
Promp
ProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (-93.12%)
Mutual labels:  reinforcement-learning
Qualia2.0
Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-98.44%)
Mutual labels:  reinforcement-learning
Reinforcementlearninganintroduction.jl
Julia code for the book Reinforcement Learning An Introduction
Stars: ✭ 117 (-95.55%)
Mutual labels:  reinforcement-learning
Senseact
SenseAct: A computational framework for developing real-world robot learning tasks
Stars: ✭ 153 (-94.19%)
Mutual labels:  reinforcement-learning
Pairstrade Fyp 2019
We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.
Stars: ✭ 109 (-95.86%)
Mutual labels:  reinforcement-learning
C51 Ddqn Keras
C51-DDQN in Keras
Stars: ✭ 115 (-95.63%)
Mutual labels:  reinforcement-learning
Energy Py
Reinforcement learning for energy systems
Stars: ✭ 148 (-94.38%)
Mutual labels:  reinforcement-learning
A3c Pytorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
Stars: ✭ 108 (-95.9%)
My bibliography for research on autonomous driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Stars: ✭ 197 (-92.52%)
Mutual labels:  reinforcement-learning
Numpy Ml
Machine learning, in numpy
Stars: ✭ 11,100 (+321.73%)
Mutual labels:  reinforcement-learning
Stable Baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-95.63%)
Mutual labels:  reinforcement-learning
Iccv2019 Learningtopaint
ICCV2019 - A painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning.
Stars: ✭ 1,995 (-24.2%)
Mutual labels:  reinforcement-learning
Ailearnnotes
Artificial Intelligence Learning Notes.
Stars: ✭ 195 (-92.59%)
Mutual labels:  reinforcement-learning
Reinforce.jl
Abstractions, algorithms, and utilities for reinforcement learning in Julia
Stars: ✭ 178 (-93.24%)
Mutual labels:  reinforcement-learning
Doudizhu
AI斗地主
Stars: ✭ 149 (-94.34%)
Mutual labels:  reinforcement-learning
Deeptraffic
DeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.
Stars: ✭ 1,528 (-41.95%)
Stock Price Trade Analyzer
This is a Python 3.0 project for analyzing stock prices and methods of stock trading. It uses native Python tools and Google TensorFlow machine learning.
Stars: ✭ 35 (-98.67%)
Mutual labels:  reinforcement-learning
Adahessian
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
Stars: ✭ 114 (-95.67%)
Mutual labels:  hessian
Andrew Ng Notes
This is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-93.16%)
Mutual labels:  reinforcement-learning
Minecraft Reinforcement Learning
Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft
Stars: ✭ 33 (-98.75%)
Emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-98.82%)
Mutual labels:  reinforcement-learning
Mango
A high-performance, open-source java RPC framework.
Stars: ✭ 150 (-94.3%)
Mutual labels:  hessian
Studybook
Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)
Stars: ✭ 1,457 (-44.64%)
Mutual labels:  reinforcement-learning
Impala Distributed Tensorflow
Stars: ✭ 28 (-98.94%)
Mutual labels:  reinforcement-learning
Tensorflow2 Deep Reinforcement Learning
Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"
Stars: ✭ 204 (-92.25%)
Gym
Seoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-98.97%)
Mutual labels:  reinforcement-learning
Handful Of Trials Pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Stars: ✭ 112 (-95.74%)
Mutual labels:  reinforcement-learning
Awesome Ai In Finance
🔬 A curated list of awesome machine learning strategies & tools in financial market.
Stars: ✭ 910 (-65.43%)
Mutual labels:  reinforcement-learning
Tradzqai
Trading environnement for RL agents, backtesting and training.
Stars: ✭ 150 (-94.3%)
Mutual labels:  reinforcement-learning
Gym Dart
OpenAI Gym environments using DART
Stars: ✭ 20 (-99.24%)
Mutual labels:  reinforcement-learning
Tetris Ai
A deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-95.86%)
Acis
Actor-Critic Instance Segmentation (CVPR 2019)
Stars: ✭ 15 (-99.43%)
Mutual labels:  reinforcement-learning
Gail Tf
Tensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-93.2%)
Mutual labels:  reinforcement-learning
Mojitalk
Code for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090
Stars: ✭ 107 (-95.93%)
Mutual labels:  reinforcement-learning
Go Bot Drl
Goal-Oriented Chatbot trained with Deep Reinforcement Learning
Stars: ✭ 149 (-94.34%)
Cartpole
OpenAI's cartpole env solver.
Stars: ✭ 107 (-95.93%)
Mutual labels:  reinforcement-learning
Deep Reinforcement Learning For Dialogue Generation In Tensorflow
Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow
Stars: ✭ 178 (-93.24%)
Lang Emerge Parlai
Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI
Stars: ✭ 106 (-95.97%)
Mutual labels:  reinforcement-learning
Open Quadruped
An open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.
Stars: ✭ 148 (-94.38%)
Mutual labels:  reinforcement-learning
Sofa Hessian
An internal improved version of Hessian powered by Ant Financial.
Stars: ✭ 105 (-96.01%)
Mutual labels:  hessian
Macad Gym
Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
Stars: ✭ 106 (-95.97%)
Chanlun
文件 笔和线段的一种划分.py,只需要把k线high,low数据输入,就能自动实现笔,线段,中枢,买卖点,走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难,有人说选股很容易,有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统,当你的影响可以被忽略,你才能实现,Jiang主席所谓之,闷声发大财。
Stars: ✭ 206 (-92.17%)
301-360 of 835 similar projects