All Projects → zsdonghao → Imitation Learning Dagger Torcs

zsdonghao / Imitation Learning Dagger Torcs

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Imitation Learning Dagger Torcs

Awesome Tensorlayer
A curated list of dedicated resources and applications
Stars: ✭ 248 (+313.33%)
Mutual labels:  tensorflow-tutorials, tensorlayer
Tensorlayer Tricks
How to use TensorLayer
Stars: ✭ 357 (+495%)
Mutual labels:  tensorflow-tutorials, tensorlayer
data aggregation
This repository contains the code for the CVPR 2020 paper "Exploring Data Aggregation in Policy Learning for Vision-based Urban Autonomous Driving"
Stars: ✭ 26 (-56.67%)
Mutual labels:  dagger, imitation-learning
Tensorflow Tutorial
TensorFlow and Deep Learning Tutorials
Stars: ✭ 748 (+1146.67%)
Mutual labels:  tensorflow-tutorials, tensorlayer
Spatial-Transformer-Nets
Spatial Transformer Nets in TensorFlow/ TensorLayer
Stars: ✭ 38 (-36.67%)
Mutual labels:  tensorflow-tutorials, tensorlayer
Tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
Stars: ✭ 6,796 (+11226.67%)
Mutual labels:  tensorflow-tutorials, tensorlayer
Dagger2demo
Stars: ✭ 30 (-50%)
Mutual labels:  dagger
Deterministic Gail Pytorch
PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning
Stars: ✭ 44 (-26.67%)
Mutual labels:  imitation-learning
Tensorflow examples
reference code for tensorflow
Stars: ✭ 14 (-76.67%)
Mutual labels:  tensorflow-tutorials
Androidall
Android 程序员需要掌握的技术栈:数据结构算法、程序架构、设计模式、性能优化、插件化、热更新、Kotlin、NDK、Jetpack,以及常用的开源框架源码分析如 Flutter、Router、RxJava、Glide、LeakCanary、Dagger2、Retrofit、OkHttp、ButterKnife 等
Stars: ✭ 849 (+1315%)
Mutual labels:  dagger
Tensorflow Tutorials For Time Series
TensorFlow Tutorial for Time Series Prediction
Stars: ✭ 1,067 (+1678.33%)
Mutual labels:  tensorflow-tutorials
Deep Nlp
Tensorflow Tutorial files and Implementations of various Deep NLP and CV Models.
Stars: ✭ 51 (-15%)
Mutual labels:  tensorflow-tutorials
Tensorflow Serving sidecar
Serve machine learning models using tensorflow serving
Stars: ✭ 41 (-31.67%)
Mutual labels:  tensorflow-tutorials
Hyperpose
HyperPose: A Collection of Real-time Human Pose Estimation
Stars: ✭ 961 (+1501.67%)
Mutual labels:  tensorlayer
Tr2main
Tomb Raider II Injector Dynamic Library
Stars: ✭ 46 (-23.33%)
Mutual labels:  dagger
Tensorflow In Practice Specialization
DeepLearning.AI TensorFlow Developer Professional Certificate Specialization
Stars: ✭ 29 (-51.67%)
Mutual labels:  tensorflow-tutorials
Awesome Android Kotlin Apps
👓 A curated list of awesome android kotlin apps by open-source contributors.
Stars: ✭ 1,058 (+1663.33%)
Mutual labels:  dagger
Tensorflow Find Object
📸 A simple application to demonstrate TensorflowJS using mobile net model to predict objects via camera API.
Stars: ✭ 12 (-80%)
Mutual labels:  tensorflow-tutorials
Star Wars Shop
Simple project with clean architecture and android lifecycle
Stars: ✭ 37 (-38.33%)
Mutual labels:  dagger
Tensorflow From Zero To One
TensorFlow 最佳学习资源大全(含课程、书籍、博客、公开课等内容)
Stars: ✭ 1,052 (+1653.33%)
Mutual labels:  tensorflow-tutorials

Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

This repository implements a simple algorithm for imitation learning: DAGGER. In this example, the agent only learns to control the steer [-1, 1], the speed is computed automatically in gym_torcs.TorcsEnv.

Requirements

  1. Ubuntu (I only test on this)
  2. Python 3
  3. TensorLayer and TensorFlow
  4. Gym-Torcs

Setting Up

It is a little bit boring to set up the environment, but any incorrect configurations will lead to FAILURE. After installing Gym-Torcs, please follow the instructions to confirm everything work well:

  • Open a terminal:

    • Run sudo torcs -vision to start a game
    • Race --> Practice --> Configure Race: set the driver to scr_server 1 instead of player
    • Open Torcs server by selecting Race --> Practice --> New Race: This should result that Torcs keeps a blue screen with several text information.
  • Open another terminal:

    • Run python snakeoil3_gym.py on another terminal, it will shows how the fake AI control the car.
    • Press F2 to see the driver view.
  • Set image size to 64x64x3:

    • The model is trained on 64x64 RGB observation.
    • Run sudo torcs -vision to start a game
    • Options --> Display --> select 64x64 --> Apply

Usage

Make sure everything above work well and then run:

  • python dagger.py

It will start a Torcs server at the beginning of every episode, and terminate the server when the car crashs or the speed is too low. Note that, the self-contained gym_torcs.py is modified from Gym-Torcs, you can try different settings (like default speed, terminated speed) by modifying it.

Results

After Episode 1, the car crashes after 315 steps.

After Episode 3, the car does not crash anymore !!!

The number of steps and episodes might vary depending on the parameters initialization.

ENJOY !

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].