reinforcement learning

UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers

Sep 30, 2021 2 min read

UPDeT

Official Implementation of UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers (ICLR 2021 spotlight)

The framework is inherited from PyMARL. UPDeT is written in pytorch and uses SMAC as its environment.

Installation instructions

Installing dependencies:

pip install -r requirements.txt

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.

bash install_sc2.sh

Run an experiment

Before training your own transformer-based multi-agent model, there are a list of things to note.

Currently, this repository supports marine-based battle scenarios. e.g. 3m, 8m, 5m_vs_6m.
If you are interested in training a different unit type, carefully modify the Transformer Parameters block at src/config/default.yaml and revise the _build_input_transformer function in basic_controller.python.
Before running the experiment, check the agent type in Agent Parameters block at src/config/default.yaml.
This repository contains two new transformer-based agents from the UPDeT paper including
- Standard UPDeT
- Aggregation Transformer

Training script

python3 src/main.py --config=vdn --env-config=sc2 with env_args.map_name=5m_vs_6m

All results will be stored in the Results/ folder.

Performance

Single battle scenario

Surpass the GRU baseline on hard 5m_vs_6m with:

Multiple battle scenarios

Zero-shot generalize to different tasks:

Result on 7m-5m-3m transfer learning.

Note: Only UPDeT can be deployed to other scenarios without changing the model’s architecture.

More details please refer to UPDeT paper.

Bibtex

@article{hu2021updet,
  title={UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers},
  author={Hu, Siyi and Zhu, Fengda and Chang, Xiaojun and Liang, Xiaodan},
  journal={arXiv preprint arXiv:2101.08001},
  year={2021}
}

License

The MIT License

GitHub

reinforcement learning Transformer

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Minimal implementaion of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Minimal implementaion of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Minimal implementaion of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

14 February 2022

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling

03 December 2021

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

17 January 2023

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation This repository is an official implementation of TOIST: TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation Pengfei Li, Beiwen Tian, Yongliang Shi, Xiaoxue

17 January 2023

reinforcement learning

[TMC] Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning

[TMC] Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning

03 September 2022

reinforcement learning

AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning

AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning

03 September 2022

Distributed and Energy-Efficient Mobile Crowdsensing with Charging Stations by Deep Reinforcement Learning

Distributed and Energy-Efficient Mobile Crowdsensing with Charging Stations by Deep Reinforcement Learning

03 September 2022

Standard interface for entity based reinforcement learning environments

Standard interface for entity based reinforcement learning environments

03 September 2022