doerlbh / mentalRL

Licence: other

Code for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mentalRL

Reinforcement learning in python

Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa

Stars: ✭ 134 (+509.09%)

Mutual labels: q-learning

Data Science Free

Free Resources For Data Science created by Shubham Kumar

Stars: ✭ 232 (+954.55%)

Mutual labels: q-learning

Flow-Shop-Scheduling-Based-On-Reinforcement-Learning-Algorithm

Operations Research Application Project - Flow Shop Scheduling Based On Reinforcement Learning Algorithm

Stars: ✭ 73 (+231.82%)

Mutual labels: q-learning

Gym Fx

Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress

Stars: ✭ 151 (+586.36%)

Mutual labels: q-learning

Rl trading

An environment to high-frequency trading agents under reinforcement learning

Stars: ✭ 205 (+831.82%)

Mutual labels: q-learning

Grid royale

A life simulation for exploring social dynamics

Stars: ✭ 252 (+1045.45%)

Mutual labels: q-learning

Tetris Ai

A deep reinforcement learning bot that plays tetris

Stars: ✭ 109 (+395.45%)

Mutual labels: q-learning

DRL in CV

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

Stars: ✭ 59 (+168.18%)

Mutual labels: q-learning

Deep Rl Trading

playing idealized trading games with deep reinforcement learning

Stars: ✭ 228 (+936.36%)

Mutual labels: q-learning

pacman-ai

A.I. plays the original 1980 Pacman using Neuroevolution of Augmenting Topologies and Deep Q Learning

Stars: ✭ 26 (+18.18%)

Mutual labels: q-learning

Accel Brain Code

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

Stars: ✭ 166 (+654.55%)

Mutual labels: q-learning

Deep Math Machine Learning.ai

A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.

Stars: ✭ 173 (+686.36%)

Mutual labels: q-learning

king-pong

Deep Reinforcement Learning Pong Agent, King Pong, he's the best

Stars: ✭ 23 (+4.55%)

Mutual labels: q-learning

Deep Qlearning Agent For Traffic Signal Control

A framework where a deep Q-Learning Reinforcement Learning agent tries to choose the correct traffic light phase at an intersection to maximize traffic efficiency.

Stars: ✭ 136 (+518.18%)

Mutual labels: q-learning

LearnSnake

🐍 AI that learns to play Snake using Q-Learning (Reinforcement Learning)

Stars: ✭ 69 (+213.64%)

Mutual labels: q-learning

Ctc Executioner

Master Thesis: Limit order placement with Reinforcement Learning

Stars: ✭ 112 (+409.09%)

Mutual labels: q-learning

Learningx

Deep & Classical Reinforcement Learning + Machine Learning Examples in Python

Stars: ✭ 241 (+995.45%)

Mutual labels: q-learning

Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Stars: ✭ 27 (+22.73%)

Mutual labels: q-learning

Multiagent-RL

Multiagent reinforcement learning simulation framework - Undergraduate thesis in Mechatronics Engineering at the University of Brasília

Stars: ✭ 59 (+168.18%)

Mutual labels: q-learning

Deep-Reinforcement-Learning-With-Python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Stars: ✭ 222 (+909.09%)

Mutual labels: q-learning

View All Similar Projects ➔

mentalRL

(image credit to HBR)

Code for our AAMAS 2020 paper:

"A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry"

by Baihan Lin (Columbia)*, Guillermo Cecchi (IBM Research), Djallel Bouneffouf (IBM Research), Jenna Reinen (IBM Research) and Irina Rish (Mila, UdeM).

*Corresponding

For the latest full paper: https://arxiv.org/abs/1906.11286

For my oral talk at AAMAS 2020: https://youtu.be/CQBdQz1bmls

All the experimental results can be reproduced using the code in this repository. Feel free to contact me by [email protected] if you have any question about our work.

Abstract

Drawing an inspiration from behavioral studies of human decision making, we propose here a more general and flexible parametric framework for reinforcement learning that extends standard Q-learning to a two-stream model for processing positive and negative rewards, and allows to incorporate a wide range of reward-processing biases -- an important component of human decision making which can help us better understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems, as well as various neuropsychiatric conditions associated with disruptions in normal reward processing. From the computational perspective, we observe that the proposed Split-QL model and its clinically inspired variants consistently outperform standard Q-Learning and SARSA methods, as well as recently proposed Double Q-Learning approaches, on simulated tasks with particular reward distributions, a real-world dataset capturing human decision-making in gambling tasks, and the Pac-Man game in a lifelong learning setting across different reward stationarities.

Info

Language: Python3, Python2, bash

Platform: MacOS, Linux, Windows

by Baihan Lin, Sep 2018

Citation

If you find this work helpful, please try the models out and cite our works. Thanks!

Reinforcement Learning case (main paper):

@inproceedings{lin2020astory,
  title={A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry},
  author={Lin, Baihan and Cecchi, Guillermo and Bouneffouf, Djallel and Reinen, Jenna and Rish, Irina},
  booktitle = {Proceedings of the Nineteenth International Conference on Autonomous Agents and Multi-Agent Systems, {AAMAS-20}},
  publisher = {International Foundation for Autonomous Agents and Multiagent Systems},             
  pages     = {744-752},
  year      = {2020},
  month     = {5},
  doi       = {},
  url       = {},
}


@inproceedings{lin2019split,
  title     = {Split Q Learning: Reinforcement Learning with Two-Stream Rewards},
  author    = {Lin, Baihan and Bouneffouf, Djallel and Cecchi, Guillermo},
  booktitle = {Proceedings of the Twenty-Eighth International Joint Conference on
               Artificial Intelligence, {IJCAI-19}},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},             
  pages     = {6448--6449},
  year      = {2019},
  month     = {7},
}

Contextual Bandit case:

@article{lin2020unified,
  title={Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits, and RL},
  author={Lin, Baihan and Cecchi, Guillermo and Bouneffouf, Djallel and Reinen, Jenna and Rish, Irina},
  journal={arXiv preprint arXiv:2005.04544},
  year={2020}
}

Tasks

Markov Decision Process (MDP) example with multi-modal reward distributions
Multi-Armed Bandits (MAB) example with multi-modal reward distributions
Iowa Gambling Task (IGT) example scheme 1 and 2
PacMan RL game with different stationarities

Requirements

Python 3 for MDP and IGT tasks, and Python 2.7 for PacMan task.
PyTorch
numpy and scikit-learn

Videos of mental agents playing PacMan

AD ("Alzheimer's Disease")

ADD ("addition")

ADHD ("ADHD")

bvFTD (the behavioral variant of Frontotemporal dementia)

CP ("Chronic Pain")

PD ("Parkinson's Disease")

M ("moderate")

SQL ("Split Q-Learning")

PQL ("Positive Q-Learning")

NQL ("Negative Q-Learning")

QL ("Q-Learning")

DQL ("Double Q-Learning")

Acknowledgements

The PacMan game was built upon Berkeley AI Pac-Man http://ai.berkeley.edu/project_overview.html. We modify many of the original files and included our comparison.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

doerlbh / mentalRL

Programming Languages

Labels

Projects that are alternatives of or similar to mentalRL

mentalRL

Info

Citation

Tasks

Requirements

Videos of mental agents playing PacMan

Acknowledgements