All Projects → openai → Imitation

openai / Imitation

Licence: mit
Code for the paper "Generative Adversarial Imitation Learning"

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to Imitation

Paper For Mac
🖥 Unofficial Dropbox Paper client for macOS
Stars: ✭ 427 (-23.06%)
Mutual labels:  paper
Nlp Paper
NLP Paper
Stars: ✭ 484 (-12.79%)
Mutual labels:  paper
Cvpr 2019 Paper Statistics
Statistics and Visualization of acceptance rate, main keyword of CVPR 2019 accepted papers for the main Computer Vision conference (CVPR)
Stars: ✭ 527 (-5.05%)
Mutual labels:  paper
Cvpr2021 Papers With Code
CVPR 2021 论文和开源项目合集
Stars: ✭ 7,138 (+1186.13%)
Mutual labels:  paper
Conditional Pixelcnn Decoder
Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network
Stars: ✭ 479 (-13.69%)
Mutual labels:  paper
Daily Paper Computer Vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
Stars: ✭ 4,977 (+796.76%)
Mutual labels:  paper
Yatopia
The Most Powerful and Feature Rich Minecraft Server Software!
Stars: ✭ 408 (-26.49%)
Mutual labels:  paper
Hugo Paper
🥛 A simple, clean, flexible Hugo theme
Stars: ✭ 538 (-3.06%)
Mutual labels:  paper
Rgan
Recurrent (conditional) generative adversarial networks for generating real-valued time series data.
Stars: ✭ 480 (-13.51%)
Mutual labels:  paper
Qlib
Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies. An increasing number of SOTA Quant research works/papers are released in Qlib.
Stars: ✭ 7,582 (+1266.13%)
Mutual labels:  paper
Jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Stars: ✭ 4,863 (+776.22%)
Mutual labels:  paper
Iaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
Stars: ✭ 468 (-15.68%)
Mutual labels:  paper
Arxiv Style
A Latex style and template for paper preprints (based on NIPS style)
Stars: ✭ 497 (-10.45%)
Mutual labels:  paper
Research Method
论文写作与资料分享
Stars: ✭ 436 (-21.44%)
Mutual labels:  paper
Paper
On self sovereign human identity.
Stars: ✭ 537 (-3.24%)
Mutual labels:  paper
Knowledge Distillation Papers
knowledge distillation papers
Stars: ✭ 422 (-23.96%)
Mutual labels:  paper
Mohist
Minecraft Forge Hybrid server implementing the Paper/Spigot/Bukkit API, formerly known as Thermos/Cauldron/MCPC+
Stars: ✭ 489 (-11.89%)
Mutual labels:  paper
Mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
Stars: ✭ 548 (-1.26%)
Mutual labels:  paper
Srflow
Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch
Stars: ✭ 537 (-3.24%)
Mutual labels:  paper
Pycnn
Image Processing with Cellular Neural Networks in Python
Stars: ✭ 509 (-8.29%)
Mutual labels:  paper

Status: Archive (code is provided as-is, no updates expected)

========================================= Generative Adversarial Imitation Learning


Jonathan Ho and Stefano Ermon

Contains an implementation of Trust Region Policy Optimization (Schulman et al., 2015).

Dependencies:

  • OpenAI Gym >= 0.1.0, mujoco_py >= 0.4.0
  • numpy >= 1.10.4, scipy >= 0.17.0, theano >= 0.8.2
  • h5py, pytables, pandas, matplotlib

Provided files:

  • expert_policies/* are the expert policies, trained by TRPO (scripts/run_rl_mj.py) on the true costs
  • scripts/im_pipeline.py is the main training and evaluation pipeline. This script is responsible for sampling data from experts to generate training data, running the training code (scripts/imitate_mj.py), and evaluating the resulting policies.
  • pipelines/* are the experiment specifications provided to scripts/im_pipeline.py
  • results/* contain evaluation data for the learned policies
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].