Masked Event Modeling: Self-Supervised Pretraining for Event Cameras

Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

Event cameras asynchronously capture brightness changes with low latency, high temporal resolution, and high dynamic range. However, annotation of event data is a costly and laborious process, which limits the use of deep learning methods for classification and other semantic tasks with the event modality. To reduce the dependency on labeled event data, we introduce Masked Event Modeling (MEM), a self-supervised framework for events. Our method pretrains a neural network on unlabeled events, which can originate from any event camera recording. Subsequently, the pretrained model is finetuned on a downstream task, leading to a consistent improvement of the task accuracy. For example, our method reaches state-of-the-art classification accuracy across three datasets, N-ImageNet, N-Cars, and N-Caltech101, increasing the top-1 accuracy of previous work by significant margins. When tested on real-world event data, MEM is even superior to supervised RGB-based pretraining. The models pretrained with MEM are also label-efficient and generalize well to the dense task of semantic image segmentation.

OriginalspracheEnglisch
TitelProceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten2367-2377
Seitenumfang11
ISBN (elektronisch)9798350318920
DOIs
PublikationsstatusVeröffentlicht - 2024
Veranstaltung2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, USA/Vereinigte Staaten
Dauer: 4 Jan. 20248 Jan. 2024

Publikationsreihe

NameProceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024

Konferenz

Konferenz2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024
Land/GebietUSA/Vereinigte Staaten
OrtWaikoloa
Zeitraum4/01/248/01/24

Fingerprint

Untersuchen Sie die Forschungsthemen von „Masked Event Modeling: Self-Supervised Pretraining for Event Cameras“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren