Handful Of Trials PytorchUnofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Stars: ✭ 112 (-17.04%)
SafeoptSafe Bayesian Optimization
Stars: ✭ 90 (-33.33%)
Meta rlThe Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.
Stars: ✭ 36 (-73.33%)
Malmo ChallengeMalmo Collaborative AI Challenge - Team Pig Catcher
Stars: ✭ 64 (-52.59%)
OpenaigymSolving OpenAI Gym problems.
Stars: ✭ 98 (-27.41%)
Visual Pushing GraspingTrain robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Stars: ✭ 516 (+282.22%)
TntSimple tools for logging and visualizing, loading and training
Stars: ✭ 1,298 (+861.48%)
Stock Price Trade AnalyzerThis is a Python 3.0 project for analyzing stock prices and methods of stock trading. It uses native Python tools and Google TensorFlow machine learning.
Stars: ✭ 35 (-74.07%)
Holdem🃏 OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning
Stars: ✭ 135 (+0%)
NavbotUsing RGB Image as Visual Input for Mapless Robot Navigation
Stars: ✭ 111 (-17.78%)
Categorical DqnA working implementation of the Categorical DQN (Distributional RL).
Stars: ✭ 90 (-33.33%)
RosettastoneHearthstone simulator using C++ with some reinforcement learning
Stars: ✭ 510 (+277.78%)
Rlai ExercisesExercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Stars: ✭ 97 (-28.15%)
SeqganA simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (+271.85%)
Nlp overviewOverview of Modern Deep Learning Techniques Applied to Natural Language Processing
Stars: ✭ 1,104 (+717.78%)
ReaverReaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Stars: ✭ 499 (+269.63%)
AutokernelAutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
Stars: ✭ 485 (+259.26%)
Nlg RlAccelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction
Stars: ✭ 59 (-56.3%)
TorchcraftConnecting Torch to StarCraft
Stars: ✭ 1,341 (+893.33%)
Learning2runOur NIPS 2017: Learning to Run source code
Stars: ✭ 57 (-57.78%)
Vowpal wabbitVowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Stars: ✭ 7,815 (+5688.89%)
Learning Notes💡 Repo of learning notes in DRL and DL, theory, codes, models and notes maybe.
Stars: ✭ 90 (-33.33%)
Minecraft Reinforcement LearningDeep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft
Stars: ✭ 33 (-75.56%)
NeurojsA JavaScript deep learning and reinforcement learning library.
Stars: ✭ 4,344 (+3117.78%)
TictactoeTic Tac Toe Machine Learning
Stars: ✭ 56 (-58.52%)
Reinforcement LearningImplementation of Reinforcement Learning algorithms in Python, based on Sutton's & Barto's Book (Ed. 2)
Stars: ✭ 55 (-59.26%)
Tetris AiA deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-19.26%)
TorchrlPytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Stars: ✭ 90 (-33.33%)
EmdpEasy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-77.04%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+236.3%)
Torch LightDeep-learning by using Pytorch. Basic nns like Logistic, CNN, RNN, LSTM and some examples are implemented by complex model.
Stars: ✭ 451 (+234.07%)
ReinforcepyCollection of reinforcement learners implemented in python. Mainly including DQN and its variants
Stars: ✭ 54 (-60%)
Pwnagotchi(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
Stars: ✭ 4,678 (+3365.19%)
Ngsim envLearning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-28.89%)
MapleaiAI各领域学习资料整理。(A collection of all skills and knowledges should be got command of to obtain an AI relevant job offer. There are online blogs, my personal blogs, electronic books copy.)
Stars: ✭ 89 (-34.07%)
Pokerrl OmahaOmaha Poker functionality+some features for PokerRL Reinforcement Learning card framwork
Stars: ✭ 31 (-77.04%)
Policy Gradient MethodsImplementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-60%)
Spot mini miniDynamics and Domain Randomized Gait Modulation with Bezier Curves for Sim-to-Real Legged Locomotion.
Stars: ✭ 426 (+215.56%)
Starcraft AiReinforcement Learning and Transfer Learning based StarCraft Micromanagement
Stars: ✭ 95 (-29.63%)
Gym MinigridMinimalistic gridworld package for OpenAI Gym
Stars: ✭ 1,047 (+675.56%)
Stable BaselinesMirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-14.81%)
Gym PandaAn OpenAI Gym Env for Panda
Stars: ✭ 29 (-78.52%)
Pairstrade Fyp 2019We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.
Stars: ✭ 109 (-19.26%)
Hand dapgRepository to accompany RSS 2018 paper on dexterous hand manipulation
Stars: ✭ 88 (-34.81%)
Batch PpoEfficient Batched Reinforcement Learning in TensorFlow
Stars: ✭ 945 (+600%)
MagnetMAGNet: Multi-agents control using Graph Neural Networks
Stars: ✭ 88 (-34.81%)
GymSeoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-80%)
Keras Rl2Reinforcement learning with tensorflow 2 keras
Stars: ✭ 134 (-0.74%)
AutomataA comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (-3.7%)