Driving Strategy for Vehicles in Lane-Free Traffic Environment Based on Deep Deterministic Policy Gradient and Artificial Forces

Mehran Berahman, Majid Rostmai-Shahrbabaki, Klaus Bogenberger

Research output: Contribution to journalConference articlepeer-review

10 Scopus citations

Abstract

This paper proposes a novel driving strategy tor Connected and Automated Vehicles (CAVs) in a lane-free traffic environment. To this end, a combination of artificial forces and a reinforcement learning approach are used. To ensure the safe driving behavior of vehicles, an artificial ellipsoid border is assumed around each vehicle by which the lateral and longitudinal forces are obtained and applied. Furthermore, a longitudinal repulsive force based on a Deep Deterministic Policy Gradient (DDPG) network is exerted on the vehicles to avoid longitudinal collisions. Using this approach, the reaction of vehicles is improved, and vehicles may experience closer longitudinal space gaps allowing higher network throughput. The proposed lane-free driving methodology is implemented in the SUMO traffic simulator to showcase its benefits. Additionally, by implementing typical lane-based scenarios in SUMO with the same road condition and traffic demand as lane-free scenarios, a comparison in terms of average speed and time delay has been drawn between the proposed innovative approach and its conventional counterpart, proving the developed approach's functionality.

Original languageEnglish
Pages (from-to)14-21
Number of pages8
JournalIFAC Proceedings Volumes (IFAC-PapersOnline)
Volume55
Issue number14
DOIs
StatePublished - 1 Jul 2022
Event11th IFAC Symposium on Intelligent Autonomous Vehicles, IAV 2022 - Prague, Czech Republic
Duration: 6 Jun 20228 Jun 2022

Keywords

  • Artificial force
  • Connected and automated vehicles
  • Deep deterministic policy gradient
  • Deep reinforcement learning
  • Lane-free traffic

Fingerprint

Dive into the research topics of 'Driving Strategy for Vehicles in Lane-Free Traffic Environment Based on Deep Deterministic Policy Gradient and Artificial Forces'. Together they form a unique fingerprint.

Cite this