Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization.
To install all the required dependencies:
- Install MuJoCo engine, which can be downloaded from here.
- Install Python packages listed in
requirements.txtusingpip install -r requirements.txt. You should specify the version ofmujoco-pyinrequirements.txtdepending on the version of MuJoCo engine you have installed. - Manually download and install
d4rlpackage from here. - Manually download and install
neorlpackage from here.
Just run train.py with specifying the task name. Other hyperparameters are automatically loaded from config.
python train.py --task [TASKNAME]If you find this repository useful for your research, please cite:
@inproceedings{
mobile,
title={Model-Bellman Inconsistency Penalized Offline Policy Optimization},
author={Yihao Sun and Jiaji Zhang and Chengxing Jia and Haoxin Lin and Junyin Ye and Yang Yu},
booktitle={International Conference on Machine Learning},
year={2023}
}