Policy Gradient MethodsImplementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Stars: ✭ 54 (-97.95%)
ParlA high-performance distributed training framework for Reinforcement Learning
Stars: ✭ 2,348 (-10.79%)
Gym ContinuousdoubleauctionA custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Stars: ✭ 50 (-98.1%)
Drl paper summarySummary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.
Stars: ✭ 49 (-98.14%)
PomdpyPOMDPs in Python.
Stars: ✭ 183 (-93.05%)
GbrainGPU Javascript Library for Machine Learning
Stars: ✭ 48 (-98.18%)
World Models Sonic PytorchAttempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more work needed
Stars: ✭ 27 (-98.97%)
NavbotUsing RGB Image as Visual Input for Mapless Robot Navigation
Stars: ✭ 111 (-95.78%)
DoyouevenlearnEssential Guide to keep up with AI/ML/DL/CV
Stars: ✭ 913 (-65.31%)
Rl tradingAn environment to high-frequency trading agents under reinforcement learning
Stars: ✭ 205 (-92.21%)
Tensorflow rlreReinforcement Learning for Relation Classification from Noisy Data(TensorFlow)
Stars: ✭ 150 (-94.3%)
HawqQuantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
Stars: ✭ 108 (-95.9%)
Gym GridworldsGridworld environments for OpenAI gym.
Stars: ✭ 43 (-98.37%)
MuzeroA structured implementation of MuZero
Stars: ✭ 156 (-94.07%)
Machine Learning From ScratchSuccinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-98.4%)
PrompProMP: Proximal Meta-Policy Search
Stars: ✭ 181 (-93.12%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-98.44%)
SenseactSenseAct: A computational framework for developing real-world robot learning tasks
Stars: ✭ 153 (-94.19%)
Pairstrade Fyp 2019We tested 3 approaches for Pair Trading: distance, cointegration and reinforcement learning approach.
Stars: ✭ 109 (-95.86%)
Energy PyReinforcement learning for energy systems
Stars: ✭ 148 (-94.38%)
A3c PytorchPyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
Stars: ✭ 108 (-95.9%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+321.73%)
Stable BaselinesMirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Stars: ✭ 115 (-95.63%)
Iccv2019 LearningtopaintICCV2019 - A painting AI that can reproduce paintings stroke by stroke using deep reinforcement learning.
Stars: ✭ 1,995 (-24.2%)
AilearnnotesArtificial Intelligence Learning Notes.
Stars: ✭ 195 (-92.59%)
Reinforce.jlAbstractions, algorithms, and utilities for reinforcement learning in Julia
Stars: ✭ 178 (-93.24%)
DeeptrafficDeepTraffic is a deep reinforcement learning competition, part of the MIT Deep Learning series.
Stars: ✭ 1,528 (-41.95%)
Stock Price Trade AnalyzerThis is a Python 3.0 project for analyzing stock prices and methods of stock trading. It uses native Python tools and Google TensorFlow machine learning.
Stars: ✭ 35 (-98.67%)
AdahessianADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
Stars: ✭ 114 (-95.67%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-93.16%)
Minecraft Reinforcement LearningDeep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft
Stars: ✭ 33 (-98.75%)
EmdpEasy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Stars: ✭ 31 (-98.82%)
MangoA high-performance, open-source java RPC framework.
Stars: ✭ 150 (-94.3%)
StudybookStudy E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)
Stars: ✭ 1,457 (-44.64%)
GymSeoul AI Gym is a toolkit for developing AI algorithms.
Stars: ✭ 27 (-98.97%)
Handful Of Trials PytorchUnofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Stars: ✭ 112 (-95.74%)
Awesome Ai In Finance🔬 A curated list of awesome machine learning strategies & tools in financial market.
Stars: ✭ 910 (-65.43%)
TradzqaiTrading environnement for RL agents, backtesting and training.
Stars: ✭ 150 (-94.3%)
Gym DartOpenAI Gym environments using DART
Stars: ✭ 20 (-99.24%)
Tetris AiA deep reinforcement learning bot that plays tetris
Stars: ✭ 109 (-95.86%)
AcisActor-Critic Instance Segmentation (CVPR 2019)
Stars: ✭ 15 (-99.43%)
Gail TfTensorflow implementation of generative adversarial imitation learning
Stars: ✭ 179 (-93.2%)
MojitalkCode for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090
Stars: ✭ 107 (-95.93%)
Go Bot DrlGoal-Oriented Chatbot trained with Deep Reinforcement Learning
Stars: ✭ 149 (-94.34%)
CartpoleOpenAI's cartpole env solver.
Stars: ✭ 107 (-95.93%)
Lang Emerge ParlaiImplementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI
Stars: ✭ 106 (-95.97%)
Open QuadrupedAn open-source 3D-printed quadrupedal robot. Intuitive gait generation through 12-DOF Bezier Curves. Full 6-axis body pose manipulation. Custom 3DOF Leg Inverse Kinematics Model accounting for offsets.
Stars: ✭ 148 (-94.38%)
Sofa HessianAn internal improved version of Hessian powered by Ant Financial.
Stars: ✭ 105 (-96.01%)
Macad GymMulti-Agent Connected Autonomous Driving (MACAD) Gym environments for Deep RL. Code for the paper presented in the Machine Learning for Autonomous Driving Workshop at NeurIPS 2019:
Stars: ✭ 106 (-95.97%)
Chanlun文件 笔和线段的一种划分.py,只需要把k线high,low数据输入,就能自动实现笔,线段,中枢,买卖点,走势类型的划分了。可以把sh.csv 作为输入文件。个人简历见.pdf。时间的力量。有人说择时很困难,有人说选股很容易,有人说统计套利需要的IT配套设施很重要。还有人说系统有不可测原理。众说纷纭。分布式的系统,当你的影响可以被忽略,你才能实现,Jiang主席所谓之,闷声发大财。
Stars: ✭ 206 (-92.17%)