ElfAn End-To-End, Lightweight and Flexible Platform for Game Research
Stars: ✭ 2,057 (+544.83%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-47.02%)
deep-rtsA Real-Time-Strategy game for Deep Learning research
Stars: ✭ 152 (-52.35%)
A2cA Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Stars: ✭ 169 (-47.02%)
Gym Pybullet DronesPyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control
Stars: ✭ 168 (-47.34%)
chiA high-level framework for advanced deep learning with TensorFlow
Stars: ✭ 55 (-82.76%)
Dist-A3CDistributed A3C
Stars: ✭ 37 (-88.4%)
TF RLEagerly Experimentable!!!
Stars: ✭ 22 (-93.1%)
AcmeA library of reinforcement learning components and agents
Stars: ✭ 2,441 (+665.2%)
Awesome Ml CoursesAwesome free machine learning and AI courses with video lectures.
Stars: ✭ 2,145 (+572.41%)
Dreamerv2Mastering Atari with Discrete World Models
Stars: ✭ 287 (-10.03%)
Popular Rl AlgorithmsPyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Stars: ✭ 266 (-16.61%)
DeepBeerInventory-RLThe code for the SRDQN algorithm to train an agent for the beer game problem
Stars: ✭ 27 (-91.54%)
ddpg bipedRepository for Planar Bipedal walking robot in Gazebo environment using Deep Deterministic Policy Gradient(DDPG) using TensorFlow.
Stars: ✭ 65 (-79.62%)
CoaxThis project was moved to: https://github.com/coax-dev/coax
Stars: ✭ 166 (-47.96%)
CoachReinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Stars: ✭ 2,085 (+553.61%)
mmnMoore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Stars: ✭ 39 (-87.77%)
Rl Baselines3 ZooA collection of pre-trained RL agents using Stable Baselines3, training and hyperparameter optimization included.
Stars: ✭ 161 (-49.53%)
MindparkTestbed for deep reinforcement learning
Stars: ✭ 163 (-48.9%)
decentralized-rlDecentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
Stars: ✭ 40 (-87.46%)
MjrlReinforcement learning algorithms for MuJoCo tasks
Stars: ✭ 162 (-49.22%)
Awesome AiA curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
Stars: ✭ 161 (-49.53%)
pomdp-baselinesSimple (but often Strong) Baselines for POMDPs in PyTorch - ICML 2022
Stars: ✭ 162 (-49.22%)
Tf Adnet TrackingDeep Object Tracking Implementation in Tensorflow for 'Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning(CVPR 2017)'
Stars: ✭ 162 (-49.22%)
dqn zooThe implement of all kinds of dqn reinforcement learning with Pytorch
Stars: ✭ 42 (-86.83%)
Deep CfrScalable Implementation of Deep CFR and Single Deep CFR
Stars: ✭ 158 (-50.47%)
ParlA high-performance distributed training framework for Reinforcement Learning
Stars: ✭ 2,348 (+636.05%)
ResourcesResources on various topics being worked on at IvLabs
Stars: ✭ 158 (-50.47%)
MuzeroA structured implementation of MuZero
Stars: ✭ 156 (-51.1%)
SenseactSenseAct: A computational framework for developing real-world robot learning tasks
Stars: ✭ 153 (-52.04%)
Iccv2019 LearningtopaintICCV2019 - A painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning.
Stars: ✭ 1,995 (+525.39%)
MachinelearningroguelikeA small Roguelike game that uses Machine Learning to power its entities. Originally used in talks by Ciro & Alessia.
Stars: ✭ 270 (-15.36%)
Gym FxForex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Stars: ✭ 151 (-52.66%)
TradzqaiTrading environnement for RL agents, backtesting and training.
Stars: ✭ 150 (-52.98%)
Tensorflow rlreReinforcement Learning for Relation Classification from Noisy Data(TensorFlow)
Stars: ✭ 150 (-52.98%)
RamudroidRamudroid, autonomous solar-powered robot to clean roads, realtime object detection and webrtc based streaming
Stars: ✭ 22 (-93.1%)
Energy PyReinforcement learning for energy systems
Stars: ✭ 148 (-53.61%)
Pytorch-PCGradPytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
Stars: ✭ 179 (-43.89%)
Open QuadrupedAn open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.
Stars: ✭ 148 (-53.61%)
interp-e2e-drivingInterpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Stars: ✭ 159 (-50.16%)
DreamerDream to Control: Learning Behaviors by Latent Imagination
Stars: ✭ 269 (-15.67%)
Chess Alpha ZeroChess reinforcement learning by AlphaGo Zero methods.
Stars: ✭ 1,868 (+485.58%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (-54.23%)
alphastoneUsing self-play, MCTS, and a deep neural network to create a hearthstone ai player
Stars: ✭ 24 (-92.48%)
Rl Book Challengeself-studying the Sutton & Barto the hard way
Stars: ✭ 146 (-54.23%)
Tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Stars: ✭ 11,865 (+3619.44%)
cups-rlCustomisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated Attention multi-modal fusion) for Task-Oriented Language Grounding (tasks specified by natural language instructions) e.g. "Pick up the Cup or else"
Stars: ✭ 38 (-88.09%)
Sumo RlA simple interface to instantiate Reinforcement Learning environments with SUMO for Traffic Signal Control. Compatible with Gym Env from OpenAI and MultiAgentEnv from RLlib.
Stars: ✭ 145 (-54.55%)
rl tradingNo description or website provided.
Stars: ✭ 14 (-95.61%)
AllenactAn open source framework for research in Embodied-AI from AI2.
Stars: ✭ 144 (-54.86%)