Multi-Critic Policy Gradient Optimization for Quadcopter Coordination
To cite this repository in publications:
@article{DBLP:journals/corr/abs-2012-15472,
author = {Yoav Alon and
Huiyu Zhou},
title = {Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination
by Multi-Critic Policy Gradient Optimization},
journal = {CoRR},
volume = {abs/2012.15472},
year = {2020},
url = {https://arxiv.org/abs/2012.15472},
archivePrefix = {arXiv},
eprint = {2012.15472},
timestamp = {Fri, 08 Jan 2021 17:23:09 +0100},
biburl = {https://dblp.org/rec/journals/corr/abs-2012-15472.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}