GitHub - yuchen-x/ROLA (original) (raw)

ROLA: Robust Local Advantage Actor-Critic

This is the code for implementing the ROLA algorithm presented in the paper:Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning, IEEE The 3rd International Symposium on Multi-Robot and Multi-Agent Systems (MRS), 2021. Best Paper Award Finalist.

Minimum Requirements

Installation

cd ROLA/Anaconda_Env/
conda env create -f mrs21.yml

Core Hyper-Parameter

How to Run

rola.py with env_name='CT' n_agent=2 grid_dim=[6,6] n_envs=2 max_epi_steps=60 a_lr=0.0005 c_lr=0.0005 local_c_train_iteration=1 train_freq=2 c_target_update_freq=16 n_step_bootstrap=3 eps_decay_epis=15000 eps_end=0.05 total_epies=80000 eval_num_epi=10 eval_freq=100 eval_policy save_ckpt save_dir='ROLA_CT_6' run_idx=0 
rola.py with env_name='SBP' n_agent=2 grid_dim=[6,6] n_envs=2 max_epi_steps=100 small_box_reward=100 gamma=0.98 small_box_only terminal_reward_only a_lr=0.001 c_lr=0.003 local_c_train_iteration=4 train_freq=2 c_target_update_freq=32 n_step_bootstrap=3 eps_decay_epis=2000 eps_end=0.01 grad_clip_norm='None' total_epies=4000 eval_num_epi=10 eval_freq=100 eval_policy save_ckpt save_dir='ROLA_BP_6' run_idx=0
rola.py with env_name='pomdp_simple_spread' n_agent=3 n_envs=2 max_epi_steps=25 discrete_mul=2 obs_r=1.4 a_lr=0.001 c_lr=0.001 local_c_train_iteration=4 train_freq=2 c_target_update_freq=16 n_step_bootstrap=5 eps_decay_epis=50000 eps_end=0.05 total_epies=100000 eval_num_epi=10 eval_freq=100 eval_policy save_ckpt save_dir='ROLA_CN_6' run_idx=0
rola.py with env_name='pomdp_advanced_spread' config_name='antipodal' n_agent=4 n_envs=2 max_epi_steps=50 discrete_mul=1 obs_r=1.4 a_lr=0.001 c_lr=0.001 local_c_train_iteration=4 train_freq=2 c_target_update_freq=16 n_step_bootstrap=5 eps_decay_epis=20000 eps_end=0.05 total_epies=30000 eval_num_epi=10 eval_freq=100 eval_policy save_ckpt save_dir='ROLA_AN_6' run_idx=0

NOTE: both training and testing results are saved in the specified directory under the ./performance/ folder.

Code Structure

Citation

If you used this code for your research or found it helpful, please consider citing this paper:

@InProceedings{xiao_mrs_2021, author = "Xiao, Yuchen and Lyu, Xueguang and Amato, Christopher", title = "Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning", booktitle = "IEEE The 3rd International Symposium on Multi-Robot and Multi-Agent Systems", year = "2021" }