All Projects → hardmaru → Astool

hardmaru / Astool

Licence: other
Augmented environments with RL

Projects that are alternatives of or similar to Astool

Attention Transfer
Improving Convolutional Networks via Attention Transfer (ICLR 2017)
Stars: ✭ 1,231 (+1419.75%)
Mutual labels:  jupyter-notebook
Fnn
Embed strange attractors using a regularizer for autoencoders
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Neural Structural Optimization
Neural reparameterization improves structural optimization
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Mimic Code
MIMIC Code Repository: Code shared by the research community for the MIMIC-III database
Stars: ✭ 1,225 (+1412.35%)
Mutual labels:  jupyter-notebook
Style Semantics
Code for the paper "Controlling Style and Semantics in Weakly-Supervised Image Generation", ECCV 2020
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Object Detection On Thermal Images
Robust Object Classification of Occluded Objects in Forward Looking Infrared (FLIR) Cameras
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning
Code & Data for Introduction to Machine Learning with Scikit-Learn
Stars: ✭ 80 (-1.23%)
Mutual labels:  jupyter-notebook
Unet Tgs
Applying UNET Model on TGS Salt Identification Challenge hosted on Kaggle
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Wellnessconversation Languagemodel
Korean Language Model을 이용한 심리상담 대화 언어 모델.
Stars: ✭ 80 (-1.23%)
Mutual labels:  jupyter-notebook
Deep transfer learning nlp dhs2019
Contains the code and deck for the presentation on Applying Deep Transfer Learning for NLP in Analytics Vidhya's DataHack Summit 2019
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Talks odt
Slides and materials for most of my talks by year
Stars: ✭ 80 (-1.23%)
Mutual labels:  jupyter-notebook
Tutorials2021
Mediterranean Machine Learning school tutorials
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Kgtk
Knowledge Graph Toolkit
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Fcn.tensorflow
Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation (http://fcn.berkeleyvision.org)
Stars: ✭ 1,230 (+1418.52%)
Mutual labels:  jupyter-notebook
Learn machine learning
Road to Machine Learning
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Hands On Algorithmic Problem Solving
A middle-to-high level algorithm book designed with coding interview at heart!
Stars: ✭ 1,227 (+1414.81%)
Mutual labels:  jupyter-notebook
Dl in nlp deeppavlov cs224n spring2020
"Deep Learning in Natural Language Processing" - a course by DeepPavlov built on top of Stanford's cs224n
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Mlnetexamples
A collection of examples for the ML.NET machine learning package from Microsoft
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Scala Cheatsheet
The Biggest Scala Cheat-Sheet.
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook
Lifetime value
Stars: ✭ 81 (+0%)
Mutual labels:  jupyter-notebook

ASTool (fork of ESTool)

Evolved Biped Walker.

Code to reproduce “Reinforcement Learning for Improving Agent Design” (designrl.github.io and arxiv.org/abs/1810.03779). Uses OpenAI Gym version 9.3, rather than most recent version.

Instructions

To run pre-trained models:

python model.py ENVNAME zoo/ENVNAME.json

Where ENVNAME is one of:

augment_ant

augmentbipedhard
augmentbipedhardsmalllegs

augmentbiped
augmentbipedsmalllegs

To train new models:

python train.py ENVNAME -n 96 -e 16 -t 2

Where 96 is the number of CPU cores you have on a cloud virtual machine (the actual number of workers will be multiplied by 2). The cumulative reward used to calculate the gradients in REINFORCE will be the average of 16 trials. The trained models will be saved in log/ENVNAME...best.json

License

MIT

Citation

If you find this work useful, we would appreciate a reference to our paper:

Reinforcement Learning for Improving Agent Design. David Ha. arXiv:1810.03779

@article{ha2018designrl,
  title={Reinforcement Learning for Improving Agent Design},
  author={Ha, David},
  journal={arXiv preprint arXiv:1810.03779},
  year={2018}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].