Сooperation of partially observed agents in ad-hoc open teams

Вiнокур, Євгенiй

Сooperation of partially observed agents in ad-hoc open teams

Files

Vinokur_Bakalavrska_robota.pdf (1.24 MB)

Vinokur_Bakalavrska_robota_1.pdf (12.38 MB)

Date

2025

Authors

Вiнокур, Євгенiй

Abstract

The aim of research: Systematic comparison of eight decentralized training baselines. We are inspired by the research of , where authors tested choosing best clearing action with Deep Learning on fire spread simulation. However, authors provide limited choice of algorithms with limited metrics and encounter non-stationairty issues due to common reward. Focus of our research is evaluation through extensive benchmarking of Independent, value-decomposition, central-critic, and agent-modeling methods proposed by Papoudakis et. al evaluated under common hardware/runtime constraints. Our work considers constraints of partial observability, generalization and mixed teams. Results promote insights on beneficiary features of baselines to assist further researches in selecting or developing effective algorithms for decen- tralized planning and control. Our contribution transfers Wildfire benchmark, created by Tran Research Group to PettingZoo library to promote verification of our results.

Keywords

decentralized training baselines, Deep Learning, algorithms with limited metrics, benchmarking, PettingZoo library, bachelor`s thesis

URI

https://ekmair.ukma.edu.ua/handle/123456789/36425

Collections

F1 Прикладна математика

Full item page