An Online-training-free Adaptor for Open Heterogeneous Collaborative Perception via Diffusion Model

  • Tianhang Wang
  • , Fan Lu
  • , Sanqing Qu
  • , Bin Li
  • , Ya Wu
  • , Hu Cao
  • , Alois Knoll
  • , Guang Chen

Research output: Contribution to journalArticlepeer-review

Abstract

Collaborative perception seeks to mitigate the limitations of single-vehicle perception, such as occlusions, by facilitating communication and information sharing among connected vehicles. However, most existing works assume a homogeneous scenario where all vehicles share identity sensor types and perception model architectures. In contrast, real-world systems often involve heterogeneous agents with diverse sensor configurations and independently developed models. In such settings, directly exchanging features without proper alignment can significantly degrade performance and hinder effective collaboration. While some methods have been proposed to address heterogeneity, they typically require retraining or access to internal model parameters, making them impractical for scalable deployment. To address these challenges, we propose DiffAlign, a plug-and-play adapter that enables feature alignment across heterogeneous agents in a training-free and model-agnostic manner. DiffAlign treats received BEV features as noisy latent representations and progressively refines them through a pretrained diffusion process. This alignment strategy does not require access to model internals or any retraining, which makes it both scalable and privacy-preserving while supporting diverse sensor modalities and perception backbones. Extensive experiments on simulated OPV2V and real-world V2V4Real datasets demonstrate that DiffAlign consistently improves detection performance in heterogeneous settings, improving CoBEVT by 132.01% and 91.95%, respectively. Our method provides a practical path toward scalable, generalizable, and deployment-ready collaborative perception.

Keywords

  • Collaborative perception
  • diffusion model
  • open heterogeneous

Fingerprint

Dive into the research topics of 'An Online-training-free Adaptor for Open Heterogeneous Collaborative Perception via Diffusion Model'. Together they form a unique fingerprint.

Cite this