Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → wassname → Rl Portfolio Management

wassname / Rl Portfolio Management

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)

Labels

jupyter-notebook cryptocurrency deep-reinforcement-learning openai-gym

Projects that are alternatives of or similar to Rl Portfolio Management

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

Stars: ✭ 464 (+3.8%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

Reinforcementlearning Atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

Stars: ✭ 118 (-73.6%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

Deep Reinforcement Learning For Automated Stock Trading Ensemble Strategy Icaif 2020

Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy. ICAIF 2020. Please star.

Stars: ✭ 518 (+15.88%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

FinRL: Financial Reinforcement Learning Framework. Please star. 🔥

Stars: ✭ 3,037 (+579.42%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

Hands On Reinforcement Learning With Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Stars: ✭ 640 (+43.18%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

Deep Reinforcement Learning

Repo for the Deep Reinforcement Learning Nanodegree program

Stars: ✭ 4,012 (+797.54%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning, openai-gym

FinRL: The first open-source project for financial reinforcement learning. Please star. 🔥

Stars: ✭ 3,497 (+682.33%)

Mutual labels: deep-reinforcement-learning, openai-gym

RAD: Reinforcement Learning with Augmented Data

Stars: ✭ 268 (-40.04%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning

DrQ: Data regularized Q

Stars: ✭ 268 (-40.04%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning

Deep reinforcement learning course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

Stars: ✭ 3,232 (+623.04%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning

Deep Developmental Reinforcement Learning

Stars: ✭ 27 (-93.96%)

Mutual labels: deep-reinforcement-learning, openai-gym

Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch

Stars: ✭ 272 (-39.15%)

Mutual labels: deep-reinforcement-learning, openai-gym

Grokking Deep Reinforcement Learning

Stars: ✭ 304 (-31.99%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning

A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms

Stars: ✭ 20 (-95.53%)

Mutual labels: deep-reinforcement-learning, openai-gym

Autonomous-Drifting

Autonomous Drifting using Reinforcement Learning

Stars: ✭ 83 (-81.43%)

Mutual labels: deep-reinforcement-learning, openai-gym

Cryptocurrency Price Prediction

Cryptocurrency Price Prediction Using LSTM neural network

Stars: ✭ 271 (-39.37%)

Mutual labels: cryptocurrency, jupyter-notebook

Deep Reinforcement Learning Algorithms Implementation in PyTorch

Stars: ✭ 23 (-94.85%)

Mutual labels: deep-reinforcement-learning, openai-gym

Cryptocurrency Analysis Python

Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data

Stars: ✭ 278 (-37.81%)

Mutual labels: cryptocurrency, jupyter-notebook

lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.

Stars: ✭ 364 (-18.57%)

Mutual labels: jupyter-notebook, deep-reinforcement-learning

Code for reco-gym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising

Stars: ✭ 314 (-29.75%)

Mutual labels: jupyter-notebook, openai-gym

View All Similar Projects ➔

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" by Jiang et. al. 2017 [1].

Note2 (20190525): vermouth1992 improved this environment during their final project, I reccomend you start with their repo. Also check out the sagemaker tutorial which is based on vermouth1992's work.

Note1 (2018): the paper's authors have put the official code for the paper up and it works well

tl;dr I managed to get 8% growth on training data, but it disapeared on test data. So I couldn't replicate it. However, RL papers can be very difficult to replicate due to bugs, framework differences, and hyperparameter sensistivity

About

This paper trains an agent to choose a good portfolio of cryptocurrencies. It's reported that it can give 4-fold returns in 50 days and the paper seems to do all the right things so I wanted to see if I could achieve the same results.

This repo includes an environment for portfolio management (with unit tests). Hopefully others will find this usefull as I am not aware of any other implementations (as of 2017-07-17).

Author: wassname

License: AGPLv3

[1] Jiang, Zhengyao, Dixing Xu, and Jinjun Liang. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem." arXiv preprint arXiv:1706.10059 (2017).

Results

I have managed to overfit to the training data with no trading costs but it could not generalise to the test data. So far there have been poor results. I have not yet tried hyperparameter optimisation so it could be that parameter tweaking will allow the model to fit, or I may have subtle bugs.

VPG model,
- training: 190% portfolio growth in 50 days
- testing: 100% portfolio growth in 50 days

This test period is directly after the training period and it looks like the usefullness of the models learned knowledge may decay as it moves away from its training interval.

There are other experiments stored as notebooks in past commits.

Installing

git clone https://github.com/wassname/rl-portfolio-management.git
cd rl-portfolio-management
pip install -r requirements/requirements.txt
jupyter-notebook
- Then open tensorforce-VPG.ipynb in jupyter
- Or try an alternative agent with tensorforce-PPO.ipynb and train

Using the environment

These environments are dervied from the OpenAI environment class which you can learn about in their documentation.

These environments come with 47k steps of training data and 8k test steps. Each step represents 30 minutes. Thanks to reddit user ARRRBEEE for sharing the data.

There are three output options which you can use as follows:

import gym
import rl_portfolio_management.environments  # this registers them

env = gym.envs.spec('CryptoPortfolioEIIE-v0').make()
print("CryptoPortfolioEIIE has an history shape suitable for an EIIE model (see https://arxiv.org/abs/1706.10059)")
observation = env.reset()
print("shape =", observation["history"].shape)
# shape = (5, 50, 3)

env = gym.envs.spec('CryptoPortfolioMLP-v0').make()
print("CryptoPortfolioMLP history has an flat shape for a dense/multi-layer perceptron model")
observation = env.reset()
print("shape =", observation["history"].shape)
# shape = (750,)

env = gym.envs.spec('CryptoPortfolioAtari-v0').make()
print("CryptoPortfolioAtari history has been padded to represent an image so you can reuse models tuned on Atari games")
observation = env.reset()
print("shape =", observation["history"].shape)
# shape = (50, 50, 3)

Or define your own:

import rl_portfolio_management.environments import PortfolioEnv
df_test = pd.read_hdf('./data/poloniex_30m.hf', key='test')
env_test = PortfolioEnv(
  df=df_test,
  steps=256,
  scale=True,
  augment=0.00,
  trading_cost=0.0025,
  time_cost=0.00,
  window_length=50,
  output_mode='mlp'
)

Lets try it with a random agent and plot the results:

import numpy as np
import gym
import rl_portfolio_management.environments  # this registers them

env = gym.envs.spec('CryptoPortfolioMLP-v0').make()
steps = 150
state = env.reset()
for _ in range(steps):
    # The observation contains price history and portfolio weights
    old_portfolio_weights = state["weights"]

    # the action is an array with the new portfolio weights
    # for out action, let's change the weights by around a 20th each step
    action = old_portfolio_weights + np.random.normal(loc=0, scale=1/20., size=(4,))

    # clip and normalize since the portfolio weights should sum to one
    action = np.clip(action, 0, 1)
    action /= action.sum()

    observation, reward, done, info = env.step(action)

    if done:
        break

# plot
env.render('notebook')

Unsuprisingly, a random agent doesn't perform well in portfolio management. If it had chosen to bet on blue then black if could have outperformed any single asset, but hindsight is 20/20.

Plotting

You can run env.render('notebook') or extract a pandas dataframe and plot how you like. To use pandas: pd.DataFrame(gym.unwrapped.infos).

Tests

We have partial test coverage of the environment, just run:

python -m pytest

Files

enviroments/portfolio.py - contains an openai environment for porfolio trading
tensorforce-PPO-IEET.ipynb - notebook to try a policy gradient agent

Differences in implementation

The main differences from Jiang et. al. 2017 are:

The first step in a deep learning project should be to make sure the model can overfit, this provides a sanity check. So I am first trying to acheive good results with no trading costs.
I have not used portfolio vector memory. For ease of implementation I made the information available by using the last weights.
Instead of DPG (deterministic policy gradient) I tried and DDPG (deep deterministic policy gradient) and VPG (vanilla policy gradient) with generalized advantage estimation and PPO.
I tried to replicate the best performing CNN model from the paper and haven't attempted the LSTM or RNN models.
instead of selecting 12 assets for each window I chose 3 assets that have existed for the longest time
~~My topology had an extra layer see issue 3~~ fixed

TODO

See issue #4 and #2 for ideas on where to go from here

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 447

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (11) 🔗