An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences.

Stars: ✭ 81 (+237.5%)

Mutual labels: planning

Recurrent-Deep-Q-Learning

Solving POMDP using Recurrent networks

Stars: ✭ 52 (+116.67%)

Mutual labels: mdp

mdp

Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.

Stars: ✭ 30 (+25%)

Mutual labels: mdp

jpp

Joint Perception and Planning For Efficient Obstacle Avoidance Using Stereo Vision

Stars: ✭ 42 (+75%)

Mutual labels: planning

plasp

🗺️ ASP planning tools for PDDL

Stars: ✭ 24 (+0%)

Mutual labels: planning

rl implementations

No description or website provided.

Stars: ✭ 40 (+66.67%)

Mutual labels: hierarchical-reinforcement-learning

cs7641-assignment4

CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes

Stars: ✭ 14 (-41.67%)

Mutual labels: mdp

View All Similar Projects ➔

Hierarchical online planning and reinforcement learning on Taxi

This release consists of codes for two projects:

The MAXQ-based hierarchical online planning algorithm: MAXQ-OP
The HAMQ-based hierarchical reinforcement learning algorithm: HAMQ-INT

Taxi domain:

Overall results:

Averaged over 200 runs.

HAMQ-INT

The idea is to identify and take advantage of internal transitions within a HAM, which is represented as a partial program, for efficient hierarchical reinforcement learning. Details can be found in:

Efficient Reinforcement Learning with Hierarchies of Machines by Leveraging Internal Transitions, Aijun Bai, and Stuart Russell, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, August 19 - 25, 2017. [pdf][bib]

MAXQ-OP

This is the code release of MAXQ-OP algorithm on the Taxi domain as described in papers:

Online planning for large Markov decision processes with hierarchical decomposition, Aijun Bai, Feng Wu, and Xiaoping Chen, ACM Transactions on Intelligent Systems and Technology (ACM TIST),6(4):45:1-45:28, July 2015.
Online Planning for Large MDPs with MAXQ Decomposition (Extended Abstract), Aijun Bai, Feng Wu, and Xiaoping Chen, Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Valencia, Spain, June 2012.

Files

maxqop.{h, cpp}: the MAXQ-OP algorithm
HierarchicalFSMAgent.{h, cpp}: the HAMQ-INT algorithm
MaxQ0Agent.{h, cpp}: the MAXQ-0 algorithm
MaxQQAgent.{h, cpp}: the MAXQ-Q algorithm
agent.h: abstract Agent class
state.{h, cpp}: abstract State class
policy.{h, cpp}: Policy classes
taxi.{h, cpp}: the Taxi domain
system.{h, cpp}: agent-environment driver code
table.h: tabular V/Q functions
dot_graph.{h, cpp}: tools to generate graphviz dot files

Dependencies

libboost-dev
libboost-program-options-dev
gnuplot

Related Projects

MAXQ-OP on RoboCup Soccer Simulation 2D Challenge: https://github.com/wrighteagle2d/wrighteaglebase
Concurrent HAMQ on RoboCup Soccer Simulation 2D Keepaway Challenge: https://github.com/aijunbai/keepaway

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

aijunbai / taxi

Programming Languages

Labels

Projects that are alternatives of or similar to taxi

Hierarchical online planning and reinforcement learning on Taxi

HAMQ-INT

MAXQ-OP

Files

Dependencies

Related Projects