BabyaiBabyAI platform. A testbed for training agents to understand and execute language commands.
Stars: ✭ 490 (+1784.62%)
Reinforcement learning tutorial with demoReinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Stars: ✭ 442 (+1600%)
Gym DuckietownSelf-driving car simulator for the Duckietown universe
Stars: ✭ 379 (+1357.69%)
Tf2rlTensorFlow2 Reinforcement Learning
Stars: ✭ 353 (+1257.69%)
Irl ImitationImplementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Stars: ✭ 333 (+1180.77%)
FurnitureIKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Stars: ✭ 282 (+984.62%)
pytorchrlDeep Reinforcement Learning algorithms implemented in PyTorch
Stars: ✭ 47 (+80.77%)
DI-driveOpenDILab Auto-driving platform
Stars: ✭ 210 (+707.69%)
hgailgail, infogail, hierarchical gail implementations
Stars: ✭ 25 (-3.85%)
SelfImitationDiverseTensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
Stars: ✭ 18 (-30.77%)
Reinforce-Paraphrase-GenerationThis repository contains the data and code for the paper "An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation" (EMNLP2019).
Stars: ✭ 76 (+192.31%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+653.85%)
LycorisA lightweight and easy-to-use deep learning framework with neural architecture search.
Stars: ✭ 180 (+592.31%)
NmflibraryMATLAB library for non-negative matrix factorization (NMF): Version 1.8.1
Stars: ✭ 153 (+488.46%)
ContinuumA clean and simple data loading library for Continual Learning
Stars: ✭ 136 (+423.08%)
River🌊 Online machine learning in Python
Stars: ✭ 2,980 (+11361.54%)
Online learningOnline Learning for Human Detection in 3D Point Clouds
Stars: ✭ 97 (+273.08%)
Fwumious wabbitFwumious Wabbit, fast on-line machine learning toolkit written in Rust
Stars: ✭ 96 (+269.23%)
RoadmapGitBook: OSCP RoadMap
Stars: ✭ 89 (+242.31%)
SiddhiStream Processing and Complex Event Processing Engine
Stars: ✭ 1,185 (+4457.69%)
HyperganComposable GAN framework with api and user interface
Stars: ✭ 1,104 (+4146.15%)
Online SvrImplementation of Accurate Online Support Vector Regression in Python.
Stars: ✭ 52 (+100%)
Vowpal wabbitVowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Stars: ✭ 7,815 (+29957.69%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+24738.46%)
OnlinemoocVue前台 + Django3.1 + DjangoRestful Framework + Ant Design Pro V4后台 开发的在线教育网站及后台管理
Stars: ✭ 587 (+2157.69%)
Boost CookbookOnline examples from "Boost C++ Application Development Cookbook":
Stars: ✭ 306 (+1076.92%)
Train plus plusRepo and code of the IEEE UIC paper: Train++: An Incremental ML Model Training Algorithm to Create Self-Learning IoT Devices
Stars: ✭ 17 (-34.62%)
Competitive-Feature-LearningOnline feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (+23.08%)
Ftrl-FFMField-aware factorization machine (FFM) with FTRL
Stars: ✭ 25 (-3.85%)
oll-pythonOnline machine learning algorithms (based on OLL C++ library)
Stars: ✭ 23 (-11.54%)
Stargan V2StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
Stars: ✭ 2,700 (+10284.62%)
VibeOfficial implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
Stars: ✭ 2,080 (+7900%)
Alae[CVPR2020] Adversarial Latent Autoencoders
Stars: ✭ 3,178 (+12123.08%)
HiCMD[CVPR2020] Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification
Stars: ✭ 64 (+146.15%)
C3NetC3Net: Demoireing Network Attentive in Channel, Color and Concatenation (CVPRW 2020)
Stars: ✭ 17 (-34.62%)
handobjectconsist[cvpr 20] Demo, training and evaluation code for joint hand-object pose estimation in sparsely annotated videos
Stars: ✭ 100 (+284.62%)
Meta-Fine-Tuning[CVPR 2020 VL3] The repository for meta fine-tuning in cross-domain few-shot learning.
Stars: ✭ 29 (+11.54%)