openai / Imitation
Licence: mit
Code for the paper "Generative Adversarial Imitation Learning"
Stars: ✭ 555
Programming Languages
python
139335 projects - #7 most used programming language
Labels
Projects that are alternatives of or similar to Imitation
Cvpr 2019 Paper Statistics
Statistics and Visualization of acceptance rate, main keyword of CVPR 2019 accepted papers for the main Computer Vision conference (CVPR)
Stars: ✭ 527 (-5.05%)
Mutual labels: paper
Conditional Pixelcnn Decoder
Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network
Stars: ✭ 479 (-13.69%)
Mutual labels: paper
Daily Paper Computer Vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
Stars: ✭ 4,977 (+796.76%)
Mutual labels: paper
Yatopia
The Most Powerful and Feature Rich Minecraft Server Software!
Stars: ✭ 408 (-26.49%)
Mutual labels: paper
Rgan
Recurrent (conditional) generative adversarial networks for generating real-valued time series data.
Stars: ✭ 480 (-13.51%)
Mutual labels: paper
Qlib
Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies. An increasing number of SOTA Quant research works/papers are released in Qlib.
Stars: ✭ 7,582 (+1266.13%)
Mutual labels: paper
Jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Stars: ✭ 4,863 (+776.22%)
Mutual labels: paper
Iaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
Stars: ✭ 468 (-15.68%)
Mutual labels: paper
Arxiv Style
A Latex style and template for paper preprints (based on NIPS style)
Stars: ✭ 497 (-10.45%)
Mutual labels: paper
Knowledge Distillation Papers
knowledge distillation papers
Stars: ✭ 422 (-23.96%)
Mutual labels: paper
Mohist
Minecraft Forge Hybrid server implementing the Paper/Spigot/Bukkit API, formerly known as Thermos/Cauldron/MCPC+
Stars: ✭ 489 (-11.89%)
Mutual labels: paper
Mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
Stars: ✭ 548 (-1.26%)
Mutual labels: paper
Srflow
Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch
Stars: ✭ 537 (-3.24%)
Mutual labels: paper
Pycnn
Image Processing with Cellular Neural Networks in Python
Stars: ✭ 509 (-8.29%)
Mutual labels: paper
Status: Archive (code is provided as-is, no updates expected)
========================================= Generative Adversarial Imitation Learning
Jonathan Ho and Stefano Ermon
Contains an implementation of Trust Region Policy Optimization (Schulman et al., 2015).
Dependencies:
- OpenAI Gym >= 0.1.0, mujoco_py >= 0.4.0
- numpy >= 1.10.4, scipy >= 0.17.0, theano >= 0.8.2
- h5py, pytables, pandas, matplotlib
Provided files:
-
expert_policies/*
are the expert policies, trained by TRPO (scripts/run_rl_mj.py
) on the true costs -
scripts/im_pipeline.py
is the main training and evaluation pipeline. This script is responsible for sampling data from experts to generate training data, running the training code (scripts/imitate_mj.py
), and evaluating the resulting policies. -
pipelines/*
are the experiment specifications provided toscripts/im_pipeline.py
-
results/*
contain evaluation data for the learned policies
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].