GbrainGPU Javascript Library for Machine Learning
Stars: ✭ 48 (-51.02%)
TraxTrax — Deep Learning with Clear Code and Speed
Stars: ✭ 6,666 (+6702.04%)
DherDHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
Stars: ✭ 48 (-51.02%)
SoftlearningSoftlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Stars: ✭ 713 (+627.55%)
Grid2opGrid2Op a testbed platform to model sequential decision making in power systems.
Stars: ✭ 91 (-7.14%)
Rlseq2seqDeep Reinforcement Learning For Sequence to Sequence Models
Stars: ✭ 683 (+596.94%)
MujocounityReproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Stars: ✭ 47 (-52.04%)
Highway EnvA minimalist environment for decision-making in autonomous driving
Stars: ✭ 674 (+587.76%)
SmacSMAC: The StarCraft Multi-Agent Challenge
Stars: ✭ 435 (+343.88%)
GibsonenvGibson Environments: Real-World Perception for Embodied Agents
Stars: ✭ 666 (+579.59%)
Pytorch RlPyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Stars: ✭ 658 (+571.43%)
Reinforcement LearningReinforcement learning material, code and exercises for Udacity Nanodegree programs.
Stars: ✭ 77 (-21.43%)
Dl Nlp ReadingsMy Reading Lists of Deep Learning and Natural Language Processing
Stars: ✭ 656 (+569.39%)
Stable Baselines3PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Stars: ✭ 1,263 (+1188.78%)
HuskarlDeep Reinforcement Learning Framework + Algorithms
Stars: ✭ 414 (+322.45%)
EmdpEasy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-68.37%)
NeuraldialogpapersSummary of deep learning models for dialog systems (Tiancheng Zhao LTI, CMU)
Stars: ✭ 641 (+554.08%)
UnrealReinforcement learning with unsupervised auxiliary tasks
Stars: ✭ 390 (+297.96%)
DrlkitA High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-70.41%)
Mabalgs👤 Multi-Armed Bandit Algorithms Library (MAB) 👮
Stars: ✭ 67 (-31.63%)
RlenvsReinforcement learning environments for Torch7
Stars: ✭ 94 (-4.08%)
SafeoptSafe Bayesian Optimization
Stars: ✭ 90 (-8.16%)
CausalworldCausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Stars: ✭ 76 (-22.45%)
DeepdriveDeepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
Stars: ✭ 628 (+540.82%)
GymSeoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-72.45%)
Personae📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.
Stars: ✭ 1,140 (+1063.27%)
RecnnReinforced Recommendation toolkit built around pytorch 1.7
Stars: ✭ 362 (+269.39%)
Awesome Ai In Finance🔬 A curated list of awesome machine learning strategies & tools in financial market.
Stars: ✭ 910 (+828.57%)
Ngsim envLearning human driver models from NGSIM data with imitation learning.
Stars: ✭ 96 (-2.04%)
Rl Chatbot🤖 Deep Reinforcement Learning Chatbot
Stars: ✭ 357 (+264.29%)
FlowComputational framework for reinforcement learning in traffic control
Stars: ✭ 622 (+534.69%)
Drl papernotesNotes and comments about Deep Reinforcement Learning papers
Stars: ✭ 65 (-33.67%)
CleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
Stars: ✭ 349 (+256.12%)
AcisActor-Critic Instance Segmentation (CVPR 2019)
Stars: ✭ 15 (-84.69%)
Rl ardroneAutonomous Navigation of UAV using Reinforcement Learning algorithms.
Stars: ✭ 76 (-22.45%)
Amazon Sagemaker ExamplesExample 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Stars: ✭ 6,346 (+6375.51%)
Tf2rlTensorFlow2 Reinforcement Learning
Stars: ✭ 353 (+260.2%)
Pytorch A3cPyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Stars: ✭ 879 (+796.94%)
Arxivtimesrepository to research & share the machine learning articles
Stars: ✭ 3,651 (+3625.51%)
MaxCode for reproducing experiments in Model-Based Active Exploration, ICML 2019
Stars: ✭ 61 (-37.76%)
ExposureLearning infinite-resolution image processing with GAN and RL from unpaired image datasets, using a differentiable photo editing model.
Stars: ✭ 605 (+517.35%)
TntSimple tools for logging and visualizing, loading and training
Stars: ✭ 1,298 (+1224.49%)
OrbitOpen source collection of Reinforcement Learning Environments.
Stars: ✭ 44 (-55.1%)
Reversi Alpha ZeroReversi reinforcement learning by AlphaGo Zero methods.
Stars: ✭ 598 (+510.2%)
Habitat LabA modular high-level library to train embodied AI agents across a variety of tasks, environments, and simulators.
Stars: ✭ 587 (+498.98%)
Fast abs rlCode for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Stars: ✭ 569 (+480.61%)