PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Stars: ✭ 266 (-16.61%)

Mutual labels: reinforcement-learning

DeepBeerInventory-RL

The code for the SRDQN algorithm to train an agent for the beer game problem

Stars: ✭ 27 (-91.54%)

Mutual labels: deep-reinforcement-learning

ddpg biped

Repository for Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG) using TensorFlow.

Stars: ✭ 65 (-79.62%)

Mutual labels: ddpg

Coax

This project was moved to: https://github.com/coax-dev/coax

Stars: ✭ 166 (-47.96%)

Mutual labels: reinforcement-learning

Coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Stars: ✭ 2,085 (+553.61%)

Mutual labels: reinforcement-learning

mmn

Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks

Stars: ✭ 39 (-87.77%)

Mutual labels: deep-reinforcement-learning

Rl Baselines3 Zoo

A collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.

Stars: ✭ 161 (-49.53%)

Mutual labels: reinforcement-learning

Mindpark

Testbed for deep reinforcement learning

Stars: ✭ 163 (-48.9%)

Mutual labels: reinforcement-learning

reinforcement-learning-papers

My notes on reinforcement learning papers

Stars: ✭ 13 (-95.92%)

Mutual labels: deep-reinforcement-learning

decentralized-rl

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Stars: ✭ 40 (-87.46%)

Mutual labels: deep-reinforcement-learning

Mjrl

Reinforcement learning algorithms for MuJoCo tasks

Stars: ✭ 162 (-49.22%)

Mutual labels: reinforcement-learning

Awesome Ai

A curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)

Stars: ✭ 161 (-49.53%)

Mutual labels: reinforcement-learning

pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022

Stars: ✭ 162 (-49.22%)

Mutual labels: deep-reinforcement-learning

Tf Adnet Tracking

Deep Object Tracking Implementation in Tensorflow for 'Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning(CVPR 2017)'

Stars: ✭ 162 (-49.22%)

Mutual labels: reinforcement-learning

dqn zoo

The implement of all kinds of dqn reinforcement learning with Pytorch

Stars: ✭ 42 (-86.83%)

Mutual labels: dqn

Deep Cfr

Scalable Implementation of Deep CFR and Single Deep CFR

Stars: ✭ 158 (-50.47%)

Mutual labels: reinforcement-learning

Parl

A high-performance distributed training framework for Reinforcement Learning

Stars: ✭ 2,348 (+636.05%)

Mutual labels: reinforcement-learning

Resources

Resources on various topics being worked on at IvLabs

Stars: ✭ 158 (-50.47%)

Mutual labels: reinforcement-learning

Java Deep Learning Cookbook

Code for Java Deep Learning Cookbook

Stars: ✭ 156 (-51.1%)

Mutual labels: reinforcement-learning

Deep-Reinforcement-Learning

Introduction to Deep Reinforcement Learning

Stars: ✭ 71 (-77.74%)

Mutual labels: deep-reinforcement-learning

Muzero

A structured implementation of MuZero

Stars: ✭ 156 (-51.1%)

Mutual labels: reinforcement-learning

Senseact

SenseAct: A computational framework for developing real-world robot learning tasks

Stars: ✭ 153 (-52.04%)

Mutual labels: reinforcement-learning

Iccv2019 Learningtopaint

ICCV2019 - A painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning.

Stars: ✭ 1,995 (+525.39%)

Mutual labels: reinforcement-learning

Machinelearningroguelike

A small Roguelike game that uses Machine Learning to power its entities. Originally used in talks by Ciro & Alessia.

Stars: ✭ 270 (-15.36%)

Mutual labels: reinforcement-learning

catalyst-examples

Examples

Stars: ✭ 54 (-83.07%)

Mutual labels: deep-reinforcement-learning

Gym Fx

Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress

Stars: ✭ 151 (-52.66%)

Mutual labels: reinforcement-learning

Tradzqai

Trading environnement for RL agents, backtesting and training.

Stars: ✭ 150 (-52.98%)

Mutual labels: reinforcement-learning

Tensorflow rlre

Reinforcement Learning for Relation Classification from Noisy Data(TensorFlow)

Stars: ✭ 150 (-52.98%)

Mutual labels: reinforcement-learning

Ramudroid

Ramudroid, autonomous solar-powered robot to clean roads, realtime object detection and webrtc based streaming

Stars: ✭ 22 (-93.1%)

Mutual labels: deep-reinforcement-learning

Energy Py

Reinforcement learning for energy systems

Stars: ✭ 148 (-53.61%)

Mutual labels: reinforcement-learning

Pytorch-PCGrad

Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"

Stars: ✭ 179 (-43.89%)

Mutual labels: deep-reinforcement-learning

Open Quadruped

An open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.

Stars: ✭ 148 (-53.61%)

Mutual labels: reinforcement-learning

interp-e2e-driving

Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning

Stars: ✭ 159 (-50.16%)

Mutual labels: deep-reinforcement-learning

Study Reinforcement Learning

Studying Reinforcement Learning Guide

Stars: ✭ 147 (-53.92%)

Mutual labels: reinforcement-learning

Dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Stars: ✭ 269 (-15.67%)

Mutual labels: reinforcement-learning

snake-reinforcement-DNN

Developing a deep neural network to play a snake game

Stars: ✭ 12 (-96.24%)

Mutual labels: deep-reinforcement-learning

AI BIG DATAS ALGORITHM

大数据+人工智能+数据结构相关案例项目

Stars: ✭ 28 (-91.22%)

Mutual labels: dqn

Chess Alpha Zero

Chess reinforcement learning by AlphaGo Zero methods.

Stars: ✭ 1,868 (+485.58%)

Mutual labels: reinforcement-learning

Show Adapt And Tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Stars: ✭ 146 (-54.23%)

Mutual labels: reinforcement-learning

alphastone

Using self-play, MCTS, and a deep neural network to create a hearthstone ai player

Stars: ✭ 24 (-92.48%)

Mutual labels: deep-reinforcement-learning

Rl Book Challenge

self-studying the Sutton & Barto the hard way

Stars: ✭ 146 (-54.23%)

Mutual labels: reinforcement-learning

Tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Stars: ✭ 11,865 (+3619.44%)

Mutual labels: reinforcement-learning

Reinforcement Learning

Research repo of RL

Stars: ✭ 20 (-93.73%)

Mutual labels: deep-reinforcement-learning

cups-rl

Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated Attention multi-modal fusion) for Task-Oriented Language Grounding (tasks specified by natural language instructions) e.g. "Pick up the Cup or else"

Stars: ✭ 38 (-88.09%)

Mutual labels: a3c

Sumo Rl

A simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.

Stars: ✭ 145 (-54.55%)

Mutual labels: reinforcement-learning

rl trading

No description or website provided.

Stars: ✭ 14 (-95.61%)

Mutual labels: ppo

Articulations Robot Demo

Stars: ✭ 145 (-54.55%)

Mutual labels: reinforcement-learning

Allenact

An open source framework for research in Embodied-AI from AI2.

Stars: ✭ 144 (-54.86%)