Coordinated online learning for multiagent systems with coupled constraints and perturbed utility observations

Ezra Tampubolon, Holger Boche

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Competitive noncooperative online decision-making agents whose actions increase congestion of scarce resources constitute a model for widespread modern large-scale applications. To ensure sustainable resource behavior, we introduce a novel method to steer the agents toward a stable population state, fulfilling the given coupled resource constraints. The proposed method is a decentralized resource pricing method based on the resource loads resulting from the augmentation of the game's Lagrangian. Assuming that the online learning agents have only noisy first-order utility feedback, we show that for a polynomially decaying agents step size/learning rate, the population's dynamic will almost surely converge to generalized Nash equilibrium. A particular consequence of the latter is the fulfillment of resource constraints in the asymptotic limit. Moreover, we investigate the finite-time quality of the proposed algorithm by giving a nonasymptotic time decaying bound for the expected amount of resource constraint violation.

Original languageEnglish
Pages (from-to)5080-5095
Number of pages16
JournalIEEE Transactions on Automatic Control
Volume66
Issue number11
DOIs
StatePublished - 1 Nov 2021

Keywords

  • Agents and autonomous systems
  • constrained control
  • game theory
  • machine learning

Fingerprint

Dive into the research topics of 'Coordinated online learning for multiagent systems with coupled constraints and perturbed utility observations'. Together they form a unique fingerprint.

Cite this