All Projects → Pytorch A2c Ppo Acktr Gail → Similar Projects or Alternatives

835 Open source projects that are alternatives of or similar to Pytorch A2c Ppo Acktr Gail

Policy Gradient Methods

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Stars: ✭ 54 (-97.95%)

Mutual labels: reinforcement-learning

Parl

A high-performance distributed training framework for Reinforcement Learning

Stars: ✭ 2,348 (-10.79%)

Mutual labels: reinforcement-learning

Gym Continuousdoubleauction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

Stars: ✭ 50 (-98.1%)

Mutual labels: ppo

Drl paper summary

Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.

Stars: ✭ 49 (-98.14%)

Mutual labels: deep-reinforcement-learning

Pomdpy

POMDPs in Python.

Stars: ✭ 183 (-93.05%)

Mutual labels: reinforcement-learning

Gbrain

GPU Javascript Library for Machine Learning

Stars: ✭ 48 (-98.18%)

Mutual labels: reinforcement-learning

Drl Portfolio Management

CSCI 599 deep learning and its applications final project

Stars: ✭ 121 (-95.4%)

Mutual labels: deep-reinforcement-learning

World Models Sonic Pytorch

Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed

Stars: ✭ 27 (-98.97%)

Mutual labels: reinforcement-learning

Navbot

Using RGB Image as Visual Input for Mapless Robot Navigation

Stars: ✭ 111 (-95.78%)

Mutual labels: reinforcement-learning

Doyouevenlearn

Essential Guide to keep up with AI/ML/DL/CV

Stars: ✭ 913 (-65.31%)

Mutual labels: reinforcement-learning

Rl trading

An environment to high-frequency trading agents under reinforcement learning

Stars: ✭ 205 (-92.21%)

Mutual labels: reinforcement-learning

Udacity Deep Learning Nanodegree

This is just a collection of projects that made during my DEEPLEARNING NANODEGREE by UDACITY

Stars: ✭ 15 (-99.43%)

Mutual labels: reinforcement-learning

Tensorflow rlre

Reinforcement Learning for Relation Classification from Noisy Data(TensorFlow)

Stars: ✭ 150 (-94.3%)

Mutual labels: reinforcement-learning

Hawq

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Stars: ✭ 108 (-95.9%)

Mutual labels: hessian

Evolutionary Algorithm

Evolutionary Algorithm using Python, 莫烦Python 中文AI教学

Stars: ✭ 881 (-66.53%)

Mutual labels: reinforcement-learning

Gym Gridworlds

Gridworld environments for OpenAI gym.

Stars: ✭ 43 (-98.37%)

Mutual labels: reinforcement-learning

Muzero

A structured implementation of MuZero

Stars: ✭ 156 (-94.07%)

Mutual labels: reinforcement-learning

Machine Learning From Scratch

Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.

Stars: ✭ 42 (-98.4%)

Mutual labels: reinforcement-learning

Promp

ProMP: Proximal Meta-Policy Search

Stars: ✭ 181 (-93.12%)

Mutual labels: reinforcement-learning

Qualia2.0

Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.

Stars: ✭ 41 (-98.44%)

Mutual labels: reinforcement-learning

Reinforcementlearninganintroduction.jl

Julia code for the book Reinforcement Learning An Introduction

Stars: ✭ 117 (-95.55%)

Mutual labels: reinforcement-learning

Senseact

SenseAct: A computational framework for developing real-world robot learning tasks

Stars: ✭ 153 (-94.19%)

Mutual labels: reinforcement-learning

Pairstrade Fyp 2019

We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.

Stars: ✭ 109 (-95.86%)

Mutual labels: reinforcement-learning

C51 Ddqn Keras

C51-DDQN in Keras

Stars: ✭ 115 (-95.63%)

Mutual labels: reinforcement-learning

Energy Py

Reinforcement learning for energy systems

Stars: ✭ 148 (-94.38%)

Mutual labels: reinforcement-learning

A3c Pytorch

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Stars: ✭ 108 (-95.9%)

Mutual labels: deep-reinforcement-learning

My bibliography for research on autonomous driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Stars: ✭ 197 (-92.52%)

Mutual labels: reinforcement-learning

Numpy Ml

Machine learning, in numpy

Stars: ✭ 11,100 (+321.73%)

Mutual labels: reinforcement-learning

Stable Baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Stars: ✭ 115 (-95.63%)

Mutual labels: reinforcement-learning

Iccv2019 Learningtopaint

ICCV2019 - A painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning.

Stars: ✭ 1,995 (-24.2%)

Mutual labels: reinforcement-learning

Ailearnnotes

Artificial Intelligence Learning Notes.

Stars: ✭ 195 (-92.59%)

Mutual labels: reinforcement-learning

Reinforce.jl

Abstractions, algorithms, and utilities for reinforcement learning in Julia

Stars: ✭ 178 (-93.24%)

Mutual labels: reinforcement-learning

Doudizhu

AI斗地主

Stars: ✭ 149 (-94.34%)

Mutual labels: reinforcement-learning

Deeptraffic

DeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.

Stars: ✭ 1,528 (-41.95%)

Mutual labels: deep-reinforcement-learning

Stock Price Trade Analyzer

This is a Python 3.0 project for analyzing stock prices and methods of stock trading. It uses native Python tools and Google TensorFlow machine learning.

Stars: ✭ 35 (-98.67%)

Mutual labels: reinforcement-learning

Adahessian

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning

Stars: ✭ 114 (-95.67%)

Mutual labels: hessian

Andrew Ng Notes

This is Andrew NG Coursera Handwritten Notes.

Stars: ✭ 180 (-93.16%)

Mutual labels: reinforcement-learning

Minecraft Reinforcement Learning

Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft

Stars: ✭ 33 (-98.75%)

Mutual labels: deep-reinforcement-learning

Emdp

Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations

Stars: ✭ 31 (-98.82%)

Mutual labels: reinforcement-learning

Mango

A high-performance, open-source java RPC framework.

Stars: ✭ 150 (-94.3%)

Mutual labels: hessian

Studybook

Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)

Stars: ✭ 1,457 (-44.64%)

Mutual labels: reinforcement-learning

Impala Distributed Tensorflow

Stars: ✭ 28 (-98.94%)

Mutual labels: reinforcement-learning

Tensorflow2 Deep Reinforcement Learning

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

Stars: ✭ 204 (-92.25%)

Mutual labels: deep-reinforcement-learning

Gym

Seoul AI Gym is a toolkit for developing AI algorithms.

Stars: ✭ 27 (-98.97%)

Mutual labels: reinforcement-learning

Handful Of Trials Pytorch

Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Stars: ✭ 112 (-95.74%)

Mutual labels: reinforcement-learning

Awesome Ai In Finance

🔬 A curated list of awesome machine learning strategies & tools in financial market.

Stars: ✭ 910 (-65.43%)

Mutual labels: reinforcement-learning

Tradzqai

Trading environnement for RL agents, backtesting and training.

Stars: ✭ 150 (-94.3%)

Mutual labels: reinforcement-learning

Gym Dart

OpenAI Gym environments using DART

Stars: ✭ 20 (-99.24%)

Mutual labels: reinforcement-learning

Tetris Ai

A deep reinforcement learning bot that plays tetris

Stars: ✭ 109 (-95.86%)

Mutual labels: deep-reinforcement-learning

Acis

Actor-Critic Instance Segmentation (CVPR 2019)

Stars: ✭ 15 (-99.43%)

Mutual labels: reinforcement-learning

Gail Tf

Tensorflow implementation of generative adversarial imitation learning

Stars: ✭ 179 (-93.2%)

Mutual labels: reinforcement-learning

Mojitalk

Code for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090

Stars: ✭ 107 (-95.93%)

Mutual labels: reinforcement-learning

Go Bot Drl

Goal-Oriented Chatbot trained with Deep Reinforcement Learning

Stars: ✭ 149 (-94.34%)

Mutual labels: deep-reinforcement-learning

Cartpole

OpenAI's cartpole env solver.

Stars: ✭ 107 (-95.93%)

Mutual labels: reinforcement-learning

Deep Reinforcement Learning For Dialogue Generation In Tensorflow

Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow

Stars: ✭ 178 (-93.24%)

Mutual labels: deep-reinforcement-learning

Lang Emerge Parlai

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Stars: ✭ 106 (-95.97%)

Mutual labels: reinforcement-learning

Open Quadruped

An open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.

Stars: ✭ 148 (-94.38%)

Mutual labels: reinforcement-learning

Sofa Hessian

An internal improved version of Hessian powered by Ant Financial.

Stars: ✭ 105 (-96.01%)

Mutual labels: hessian

Macad Gym

Multi-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:

Stars: ✭ 106 (-95.97%)

Mutual labels: deep-reinforcement-learning

Chanlun

文件笔和线段的一种划分.py,只需要把k线high，low数据输入，就能自动实现笔，线段，中枢，买卖点，走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难，有人说选股很容易，有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统，当你的影响可以被忽略，你才能实现，Jiang主席所谓之，闷声发大财。

Stars: ✭ 206 (-92.17%)

Mutual labels: deep-reinforcement-learning

301-360 of 835 similar projects