TY - JOUR
T1 - MOTChallenge
T2 - A Benchmark for Single-Camera Multiple Target Tracking
AU - Dendorfer, Patrick
AU - Os̆ep, Aljos̆a
AU - Milan, Anton
AU - Schindler, Konrad
AU - Cremers, Daniel
AU - Reid, Ian
AU - Roth, Stefan
AU - Leal-Taixé, Laura
N1 - Publisher Copyright:
© 2020, The Author(s).
PY - 2021/4
Y1 - 2021/4
N2 - Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since the advent of deep learning. Although leaderboards should not be over-claimed, they often provide the most objective measure of performance and are therefore important guides for research. We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data and create a framework for the standardized evaluation of multiple object tracking methods. The benchmark is focused on multiple people tracking, since pedestrians are by far the most studied object in the tracking community, with applications ranging from robot navigation to self-driving cars. This paper collects the first three releases of the benchmark: (i) MOT15, along with numerous state-of-the-art results that were submitted in the last years, (ii) MOT16, which contains new challenging videos, and (iii) MOT17, that extends MOT16 sequences with more precise labels and evaluates tracking performance on three different object detectors. The second and third release not only offers a significant increase in the number of labeled boxes, but also provide labels for multiple object classes beside pedestrians, as well as the level of visibility for every single object of interest. We finally provide a categorization of state-of-the-art trackers and a broad error analysis. This will help newcomers understand the related work and research trends in the MOT community, and hopefully shed some light into potential future research directions.
AB - Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since the advent of deep learning. Although leaderboards should not be over-claimed, they often provide the most objective measure of performance and are therefore important guides for research. We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data and create a framework for the standardized evaluation of multiple object tracking methods. The benchmark is focused on multiple people tracking, since pedestrians are by far the most studied object in the tracking community, with applications ranging from robot navigation to self-driving cars. This paper collects the first three releases of the benchmark: (i) MOT15, along with numerous state-of-the-art results that were submitted in the last years, (ii) MOT16, which contains new challenging videos, and (iii) MOT17, that extends MOT16 sequences with more precise labels and evaluates tracking performance on three different object detectors. The second and third release not only offers a significant increase in the number of labeled boxes, but also provide labels for multiple object classes beside pedestrians, as well as the level of visibility for every single object of interest. We finally provide a categorization of state-of-the-art trackers and a broad error analysis. This will help newcomers understand the related work and research trends in the MOT community, and hopefully shed some light into potential future research directions.
KW - Computer vision
KW - Evaluation
KW - MOTA
KW - MOTChallenge
KW - Multi-object-tracking
UR - http://www.scopus.com/inward/record.url?scp=85098212408&partnerID=8YFLogxK
U2 - 10.1007/s11263-020-01393-0
DO - 10.1007/s11263-020-01393-0
M3 - Article
AN - SCOPUS:85098212408
SN - 0920-5691
VL - 129
SP - 845
EP - 881
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
IS - 4
ER -