Gym UnrealcvUnreal environments for reinforcement learning
Stars: ✭ 202 (-5.61%)
Deep CfrScalable Implementation of Deep CFR and Single Deep CFR
Stars: ✭ 158 (-26.17%)
Gym SokobanSokoban environment for OpenAI Gym
Stars: ✭ 186 (-13.08%)
ResourcesResources on various topics being worked on at IvLabs
Stars: ✭ 158 (-26.17%)
Alphazero gomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Stars: ✭ 2,570 (+1100.93%)
MuzeroA structured implementation of MuZero
Stars: ✭ 156 (-27.1%)
RlcycleA library for ready-made reinforcement learning agents and reusable components for neat prototyping
Stars: ✭ 184 (-14.02%)
AgentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Stars: ✭ 2,135 (+897.66%)
Alpha Zero GeneralA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Stars: ✭ 2,617 (+1122.9%)
Gym FxForex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (-29.44%)
PomdpyPOMDPs in Python.
Stars: ✭ 183 (-14.49%)
Tensorflow rlreReinforcement Learning for Relation Classification from Noisy Data(TensorFlow)
Stars: ✭ 150 (-29.91%)
PrompProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (-15.42%)
Open QuadrupedAn open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.
Stars: ✭ 148 (-30.84%)
ReleaseDeep Reinforcement Learning for de-novo Drug Design
Stars: ✭ 201 (-6.07%)
RainbowA PyTorch implementation of Rainbow DQN agent
Stars: ✭ 147 (-31.31%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-15.89%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (-31.78%)
Tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Stars: ✭ 11,865 (+5444.39%)
Gail TfTensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-16.36%)
AllenactAn open source framework for research in Embodied-AI from AI2.
Stars: ✭ 144 (-32.71%)
Tensorflow RlImplementations of deep RL papers and random experimentation
Stars: ✭ 176 (-17.76%)
GbA minimal C implementation of Nintendo Gameboy - An fast research environment for Reinforcement Learning
Stars: ✭ 143 (-33.18%)
PokerrlFramework for Multi-Agent Deep Reinforcement Learning in Poker
Stars: ✭ 214 (+0%)
Awesome Deep Learning Papers For Search Recommendation AdvertisingAwesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR prediction, CVR prediction), Post Ranking, Transfer, Reinforcement Learning, Self-supervised Learning and so on.
Stars: ✭ 136 (-36.45%)
Safe learningSafe reinforcement learning with stability guarantees
Stars: ✭ 140 (-34.58%)
Ai plays snakeAI trained using Genetic Algorithm and Deep Learning to play the game of snake
Stars: ✭ 137 (-35.98%)
AtariAI research environment for the Atari 2600 games 🤖.
Stars: ✭ 174 (-18.69%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (-36.92%)
Rl tradingAn environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (-4.21%)
Ml AgentsUnity Machine Learning Agents Toolkit
Stars: ✭ 12,134 (+5570.09%)
JerichoA learning environment for man-made Interactive Fiction games.
Stars: ✭ 173 (-19.16%)
Hindsight Experience ReplayThis is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Stars: ✭ 134 (-37.38%)
PaacOpen source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning
Stars: ✭ 196 (-8.41%)
Saltie🚗 Rocket League Distributed Deep Reinforcement Learning Bot
Stars: ✭ 134 (-37.38%)
Gym Pybullet DronesPyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control
Stars: ✭ 168 (-21.5%)
Move37Coding Demos from the School of AI's Move37 Course
Stars: ✭ 130 (-39.25%)
AutomataA comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (-39.25%)
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (-21.03%)
ToycarirlImplementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)
Stars: ✭ 128 (-40.19%)
Dm controlDeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Stars: ✭ 2,592 (+1111.21%)
BanditmlA lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Stars: ✭ 127 (-40.65%)
AcmeA library of reinforcement learning components and agents
Stars: ✭ 2,441 (+1040.65%)
Modular Rl[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
Stars: ✭ 126 (-41.12%)
EpgCode for the paper "Evolved Policy Gradients"
Stars: ✭ 204 (-4.67%)
Accel Brain CodeThe purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
Stars: ✭ 166 (-22.43%)
AutodromeFramework and OpenAI Gym Environment for Autonomous Vehicle Development
Stars: ✭ 214 (+0%)
Reco PapersClassic papers and resources on recommendation
Stars: ✭ 2,804 (+1210.28%)
GymfcA universal flight control tuning framework
Stars: ✭ 210 (-1.87%)
MultihopkgMulti-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Stars: ✭ 202 (-5.61%)
Naf Tensorflow"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
Stars: ✭ 192 (-10.28%)
Rl Baselines3 ZooA collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Stars: ✭ 161 (-24.77%)