XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation

Licence: other

A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Model-Based-Reinforcement-Learning-for-Online-Recommendation

omd

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Stars: ✭ 43 (+30.3%)

Mutual labels: model-based-rl

dads

Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined with model-based control.

Stars: ✭ 138 (+318.18%)

Mutual labels: model-based-rl

Deep-Reinforcement-Learning-CS285-Pytorch

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Stars: ✭ 104 (+215.15%)

Mutual labels: model-based-rl

AdMRL

Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)

Stars: ✭ 30 (-9.09%)

Mutual labels: model-based-rl

neural-mpc

No description or website provided.

Stars: ✭ 54 (+63.64%)

Mutual labels: model-based-rl

Model-Based-Reinforcement-Learning-for-Online-Recommendation

A pytorch implementation of Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation (https://arxiv.org/pdf/1911.03845.pdf).

Usage:

In the directory of IRecGAN, type command:

python main.py --click ../simulation_task1/gen_click.txt --reward ../simulation_task1/gen_reward.txt --action ../simulation_task1/gen_action.txt --model LSTM --nhid 128 --n_layers_usr 2 --optim_nll adam --optim_adv adam --batch_size 128

The variable 'interact' in main.py enables online training and evaluation with the environment in ./simulation_task1. However, many routes in ./simulation_task1 have to be changed.

./simulation_task1 contains a simulated environment(different from paper) and offline data can be generated by:

python Generate_data.py

Pytorch version: 1.1.0

Notice: write for fun and double-checking. Not the implementation in the paper. Please refer to the link in the paper for its implementation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation

Programming Languages

Labels

Projects that are alternatives of or similar to Model-Based-Reinforcement-Learning-for-Online-Recommendation

Model-Based-Reinforcement-Learning-for-Online-Recommendation

Usage: