Reinforcement Learning for Optimizing Routing in the Production Supply of Matrix Production Systems

Florian Ried, Simon Niederdränk, Johannes Fottner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Matrix production systems offer the flexibility to meet an increasingly individualized and volatile customer demand. However, production supply processes within these systems have rarely been investigated in detail despite playing an integral role in their performance. Contributing to closing this research gap, this work utilizes reinforcement learning for routing in the production supply of matrix production systems. In particular, it focuses on dispatching orders to the vehicles and scheduling the orders within a route. Various constraints are considered to simulate a realistic setting, including order time windows, vehicle battery limitations, and a vehicle capacity allowing to transport multiple items at once. A reinforcement learning framework is conceptualized and implemented, assigning orders to vehicles based on various route construction heuristics. Its observation space contains abstract information about current orders of the matrix production supply environment and specific data on the vehicles for the reinforcement learning agent to select both a vehicle and a heuristic. The action and observation spaces are complemented by a multi-criteria reward function, prompting the agent to learn not to violate any constraints of the environment while simultaneously choosing actions that lead to the most cost-effective routes after route optimization. The reinforcement learning route constructor approach is trained and deployed on a discrete-event simulation of a matrix production system, which is connected to the reinforcement learning framework via a socket interface. The approach has proven to be successful by outperforming two non-reinforcement learning heuristics for route construction.

Original languageEnglish
Title of host publicationInnovative Intelligent Industrial Production and Logistics - 5th International Conference, IN4PL 2024, Proceedings
EditorsMichele Dassisti, Kurosh Madani, Hervé Panetto
PublisherSpringer Science and Business Media Deutschland GmbH
Pages270-281
Number of pages12
ISBN (Print)9783031807596
DOIs
StatePublished - 2025
Event5th International Conference on Innovative Intelligent Industrial Production and Logistics, IN4PL 2024 - Porto, Portugal
Duration: 21 Nov 202422 Nov 2024

Publication series

NameCommunications in Computer and Information Science
Volume2372 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference5th International Conference on Innovative Intelligent Industrial Production and Logistics, IN4PL 2024
Country/TerritoryPortugal
CityPorto
Period21/11/2422/11/24

Keywords

  • Dynamic pickup-and-delivery problem
  • Matrix production systems
  • Reinforcement learning
  • Routing

Fingerprint

Dive into the research topics of 'Reinforcement Learning for Optimizing Routing in the Production Supply of Matrix Production Systems'. Together they form a unique fingerprint.

Cite this