All Projects → XueyingBai → Model-Based-Reinforcement-Learning-for-Online-Recommendation

XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation

Licence: other
A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Model-Based-Reinforcement-Learning-for-Online-Recommendation

omd
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
Stars: ✭ 43 (+30.3%)
Mutual labels:  model-based-rl
dads
Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined with model-based control.
Stars: ✭ 138 (+318.18%)
Mutual labels:  model-based-rl
Deep-Reinforcement-Learning-CS285-Pytorch
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Stars: ✭ 104 (+215.15%)
Mutual labels:  model-based-rl
AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
Stars: ✭ 30 (-9.09%)
Mutual labels:  model-based-rl
neural-mpc
No description or website provided.
Stars: ✭ 54 (+63.64%)
Mutual labels:  model-based-rl

Model-Based-Reinforcement-Learning-for-Online-Recommendation

A pytorch implementation of Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation (https://arxiv.org/pdf/1911.03845.pdf).

Usage:

In the directory of IRecGAN, type command:

python main.py --click ../simulation_task1/gen_click.txt --reward ../simulation_task1/gen_reward.txt --action ../simulation_task1/gen_action.txt --model LSTM --nhid 128 --n_layers_usr 2 --optim_nll adam --optim_adv adam --batch_size 128

The variable 'interact' in main.py enables online training and evaluation with the environment in ./simulation_task1. However, many routes in ./simulation_task1 have to be changed.

./simulation_task1 contains a simulated environment(different from paper) and offline data can be generated by:

python Generate_data.py

Pytorch version: 1.1.0

Notice: write for fun and double-checking. Not the implementation in the paper. Please refer to the link in the paper for its implementation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].