All Projects → popovicidaniela → Master-Thesis

popovicidaniela / Master-Thesis

Licence: other
Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

Programming Languages

TeX
3793 projects
python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to Master-Thesis

Rl a3c pytorch
A3C LSTM Atari with Pytorch plus A3G design
Stars: ✭ 482 (+1360.61%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
yarll
Combining deep learning and reinforcement learning.
Stars: ✭ 84 (+154.55%)
Mutual labels:  deep-reinforcement-learning, reinforcement-learning-algorithms, a3c
Pytorch Rl
Deep Reinforcement Learning with pytorch & visdom
Stars: ✭ 745 (+2157.58%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Reinforcement learning tutorial with demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+1239.39%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Deep-Reinforcement-Learning-With-Python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Stars: ✭ 222 (+572.73%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Pytorch A3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Stars: ✭ 879 (+2563.64%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Torch Ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Stars: ✭ 70 (+112.12%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Deep Reinforcement Learning With Pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Stars: ✭ 1,345 (+3975.76%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Baby A3c
A high-performance Atari A3C agent in 180 lines of PyTorch
Stars: ✭ 144 (+336.36%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Reinforcementlearning Atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
Stars: ✭ 118 (+257.58%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Reinforcement Learning
Minimal and Clean Reinforcement Learning Examples
Stars: ✭ 2,863 (+8575.76%)
Mutual labels:  deep-reinforcement-learning, a3c, actor-critic
Pytorch A2c Ppo Acktr Gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Stars: ✭ 2,632 (+7875.76%)
Mutual labels:  deep-reinforcement-learning, actor-critic, advantage-actor-critic
Fruit-API
A Universal Deep Reinforcement Learning Framework
Stars: ✭ 61 (+84.85%)
Mutual labels:  deep-reinforcement-learning, reinforcement-learning-algorithms, actor-critic
Taskflow
A General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+18469.7%)
Mutual labels:  multi-threading, multithreading
Javamtp
《Java多线程编程实战指南(设计模式篇)》源码
Stars: ✭ 575 (+1642.42%)
Mutual labels:  multi-threading, multithreading
pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
Stars: ✭ 79 (+139.39%)
Mutual labels:  deep-reinforcement-learning, a3c
Mofuw
mofuw is *MO*re *F*aster, *U*ltra minimal *W*ebserver.
Stars: ✭ 107 (+224.24%)
Mutual labels:  multi-threading, asynchronous
Hamsters.js
100% Vanilla Javascript Multithreading & Parallel Execution Library
Stars: ✭ 517 (+1466.67%)
Mutual labels:  multi-threading, multithreading
Tdp
The Darkest Pipeline - Multithreaded pipelines for modern C++
Stars: ✭ 67 (+103.03%)
Mutual labels:  multi-threading, multithreading
Aff3ct
A fast simulator and a library dedicated to the channel coding.
Stars: ✭ 240 (+627.27%)
Mutual labels:  multi-threading, multithreading

Deep Reinforcement Learning in Autonomous Driving

Works best with discrete actions: 4 workers, 1e-4 learning rate

Couldn't get it to work for the continuous actions space; something goes wrong with the actions it generates

The A3C idea

Image of A3C idea

Overview

Image of Overview

Architecture of the Artificial Neural Network

Image of Architecture of the ANN

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].