PrompProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (+34.07%)
Awesome Real World RlGreat resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
Stars: ✭ 234 (+73.33%)
EpgCode for the paper "Evolved Policy Gradients"
Stars: ✭ 204 (+51.11%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+227.41%)
Modular Rl[ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"
Stars: ✭ 126 (-6.67%)
FiredupClone of OpenAI's Spinning Up in PyTorch
Stars: ✭ 119 (-11.85%)
Rl QuadcopterTeach a Quadcopter How to Fly!
Stars: ✭ 124 (-8.15%)
MetarecPyTorch Implementations For A Series Of Deep Learning-Based Recommendation Models (IN PROGRESS)
Stars: ✭ 120 (-11.11%)
MfrLearning Meta Face Recognition in Unseen Domains, CVPR, Oral, 2020
Stars: ✭ 127 (-5.93%)
Machine learning lecturesCollection of lectures and lab lectures on machine learning and deep learning. Lab practices in Python and TensorFlow.
Stars: ✭ 118 (-12.59%)
Reinforcement learningImplementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
Stars: ✭ 132 (-2.22%)
HbayesdmHierarchical Bayesian modeling of RLDM tasks, using R & Python
Stars: ✭ 124 (-8.15%)
FewshotnlpThe source codes of the paper "Improving Few-shot Text Classification via Pretrained Language Representations" and "When Low Resource NLP Meets Unsupervised Language Model: Meta-pretraining Then Meta-learning for Few-shot Text Classification".
Stars: ✭ 115 (-14.81%)
Hindsight Experience ReplayThis is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Stars: ✭ 134 (-0.74%)
Rl MedicalDeep Reinforcement Learning (DRL) agents applied to medical images
Stars: ✭ 123 (-8.89%)
Startcraft pysc2 minigamesStartcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.
Stars: ✭ 113 (-16.3%)
Ctc ExecutionerMaster Thesis: Limit order placement with Reinforcement Learning
Stars: ✭ 112 (-17.04%)
NavbotUsing RGB Image as Visual Input for Mapless Robot Navigation
Stars: ✭ 111 (-17.78%)
AixijsAIXIjs - General Reinforcement Learning in the Browser
Stars: ✭ 128 (-5.19%)
MultiagenttorcsThe multi-agent version of TORCS for developing control algorithms for fully autonomous driving in the cluttered, multi-agent settings of everyday life.
Stars: ✭ 122 (-9.63%)
Pairstrade Fyp 2019We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.
Stars: ✭ 109 (-19.26%)
Metar CnnMeta R-CNN : Towards General Solver for Instance-level Low-shot Learning
Stars: ✭ 120 (-11.11%)
BanditmlA lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Stars: ✭ 127 (-5.93%)
Boml Bilevel Optimization Library in Python for Multi-Task and Meta Learning
Stars: ✭ 120 (-11.11%)
Saltie🚗 Rocket League Distributed Deep Reinforcement Learning Bot
Stars: ✭ 134 (-0.74%)
Rl Collision AvoidanceImplementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
Stars: ✭ 125 (-7.41%)
Reinforcementlearning AtarigamePytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (-12.59%)
Holdem🃏 OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning
Stars: ✭ 135 (+0%)
Srl ZooState Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox
Stars: ✭ 125 (-7.41%)
Move37Coding Demos from the School of AI's Move37 Course
Stars: ✭ 130 (-3.7%)
Stable BaselinesMirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-14.81%)
KeitaMy personal toolkit for PyTorch development.
Stars: ✭ 124 (-8.15%)
Coursera reinforcement learningCoursera Reinforcement Learning Specialization by University of Alberta & Alberta Machine Intelligence Institute
Stars: ✭ 114 (-15.56%)
Reinforcement learning in pythonImplementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
Stars: ✭ 134 (-0.74%)
Doom Net PytorchReinforcement learning models in ViZDoom environment
Stars: ✭ 113 (-16.3%)
Mlds2018springMachine Learning and having it Deep and Structured (MLDS) in 2018 spring
Stars: ✭ 124 (-8.15%)
StudybookStudy E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)
Stars: ✭ 1,457 (+979.26%)
AutomataA comprehensive autonomous decentralized systems framework for AI control architects.
Stars: ✭ 130 (-3.7%)
Handful Of Trials PytorchUnofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Stars: ✭ 112 (-17.04%)
Snake Ai ReinforcementAI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).
Stars: ✭ 123 (-8.89%)
Meta BlocksA modular toolbox for meta-learning research with a focus on speed and reproducibility.
Stars: ✭ 110 (-18.52%)
CanetThe code for paper "CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning"
Stars: ✭ 135 (+0%)
What I Have ReadPaper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
Stars: ✭ 110 (-18.52%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+8122.22%)
MojitalkCode for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090
Stars: ✭ 107 (-20.74%)
ToycarirlImplementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinforcement Learning Abbeel & Ng, 2004)
Stars: ✭ 128 (-5.19%)
CartpoleOpenAI's cartpole env solver.
Stars: ✭ 107 (-20.74%)
Lang Emerge ParlaiImplementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI
Stars: ✭ 106 (-21.48%)
Pytorch RlTutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Stars: ✭ 121 (-10.37%)
Policy GradientMinimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Stars: ✭ 135 (+0%)