All Projects → aijunbai → taxi

aijunbai / taxi

Licence: GPL-2.0 License
Hierarchical Online Planning and Reinforcement Learning on Taxi

Programming Languages

C++
36643 projects - #6 most used programming language
Gnuplot
187 projects
shell
77523 projects
CMake
9771 projects

Projects that are alternatives of or similar to taxi

urban-and-regional-planning-resources
Community list of data & technology resources concerning the built environment and communities. 🏙️🌳🚌🚦🗺️
Stars: ✭ 109 (+354.17%)
Mutual labels:  planning
mapus
A map tool with real-time collaboration 🗺️
Stars: ✭ 2,687 (+11095.83%)
Mutual labels:  planning
GoPlan-app
An intuitive portfolio mangaer !
Stars: ✭ 27 (+12.5%)
Mutual labels:  planning
scrum-planning-poker
Please feel FREE to try it and give feedback by searching Scrum敏捷估算 in WeChat mini program.
Stars: ✭ 30 (+25%)
Mutual labels:  planning
language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
Stars: ✭ 84 (+250%)
Mutual labels:  planning
l5kit
L5Kit - https://level-5.global/
Stars: ✭ 683 (+2745.83%)
Mutual labels:  planning
planner
Lightweight, interactive planning tool that visualizes a series of tasks using an HTML canvas
Stars: ✭ 502 (+1991.67%)
Mutual labels:  planning
nuplan-devkit
The devkit of the nuPlan dataset.
Stars: ✭ 107 (+345.83%)
Mutual labels:  planning
pytorch-hdqn
Hierarchical-DQN in pytorch (not actively maintained)
Stars: ✭ 36 (+50%)
Mutual labels:  hierarchical-reinforcement-learning
planning-wiki
By the community, for everyone. Planning.wiki is the online guide to AI Planning
Stars: ✭ 54 (+125%)
Mutual labels:  planning
DnaWeaver
A route planner for DNA assembly
Stars: ✭ 20 (-16.67%)
Mutual labels:  planning
HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
Stars: ✭ 21 (-12.5%)
Mutual labels:  hierarchical-reinforcement-learning
reinforcement-learning-papers
My notes on reinforcement learning papers
Stars: ✭ 13 (-45.83%)
Mutual labels:  hierarchical-reinforcement-learning
awesome-agi-cocosci
An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences.
Stars: ✭ 81 (+237.5%)
Mutual labels:  planning
Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
Stars: ✭ 52 (+116.67%)
Mutual labels:  mdp
mdp
Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.
Stars: ✭ 30 (+25%)
Mutual labels:  mdp
jpp
Joint Perception and Planning For Efficient Obstacle Avoidance Using Stereo Vision
Stars: ✭ 42 (+75%)
Mutual labels:  planning
plasp
🗺️ ASP planning tools for PDDL
Stars: ✭ 24 (+0%)
Mutual labels:  planning
rl implementations
No description or website provided.
Stars: ✭ 40 (+66.67%)
Mutual labels:  hierarchical-reinforcement-learning
cs7641-assignment4
CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes
Stars: ✭ 14 (-41.67%)
Mutual labels:  mdp

Hierarchical online planning and reinforcement learning on Taxi

Build Status

This release consists of codes for two projects:

  • The MAXQ-based hierarchical online planning algorithm: MAXQ-OP
  • The HAMQ-based hierarchical reinforcement learning algorithm: HAMQ-INT

Taxi domain:

taxi.png

Overall results:

data/reward.png

Averaged over 200 runs.

HAMQ-INT

The idea is to identify and take advantage of internal transitions within a HAM, which is represented as a partial program, for efficient hierarchical reinforcement learning. Details can be found in:

  • Efficient Reinforcement Learning with Hierarchies of Machines by Leveraging Internal Transitions, Aijun Bai, and Stuart Russell, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, August 19 - 25, 2017. [pdf][bib]

MAXQ-OP

This is the code release of MAXQ-OP algorithm on the Taxi domain as described in papers:

Files

  • maxqop.{h, cpp}: the MAXQ-OP algorithm
  • HierarchicalFSMAgent.{h, cpp}: the HAMQ-INT algorithm
  • MaxQ0Agent.{h, cpp}: the MAXQ-0 algorithm
  • MaxQQAgent.{h, cpp}: the MAXQ-Q algorithm
  • agent.h: abstract Agent class
  • state.{h, cpp}: abstract State class
  • policy.{h, cpp}: Policy classes
  • taxi.{h, cpp}: the Taxi domain
  • system.{h, cpp}: agent-environment driver code
  • table.h: tabular V/Q functions
  • dot_graph.{h, cpp}: tools to generate graphviz dot files

Dependencies

  • libboost-dev
  • libboost-program-options-dev
  • gnuplot

Related Projects

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].