Trajectory Similarity using Compression

Gabriel Dax, Martin Werner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present a novel approach for trajectory similarity based on Kolmogorov complexity approximated by a lossy compression of the original trajectory data using selected features compressed into a concise memory representation by means of a Bloom filter. Given the importance of trajectory data, a linear-Time distance measure with all theoretical guarantees implied by a proper metric is very powerful if it is capturing enough detail for important trajectory mining tasks. This stack of feature extraction combined with feature embedding in a Bloom filter constitutes a lossy compression for trajectory data which can easily be extended with other discrete data like travel mode. In addition, this compression has the needed properties for efficient calculation of a normalized compression distance (NCD) which approximates Kolmogorov complexity. We evaluate this novel trajectory distance measurement using very simple features and k-nearest-neighbor classification on selected realworld datasets with remarkable classification accuracies. Furthermore, we argue that the distance measure is very suited to geospatial big data applications as each trajectory is first transformed into few bits using the lossy compression stack. At time of comparison, the original trajectory geometry is not needed, instead, the sketches suffice. Despite very compressible parameters (equal or less than 1024 bit per trajectory) and very simple features, we already reach classification accuracies for real-world trajectory classification tasks of more than 80% across various datasets.

Original languageEnglish
Title of host publicationProceedings - 2021 22nd IEEE International Conference on Mobile Data Management, MDM 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages169-174
Number of pages6
ISBN (Electronic)9781665428453
DOIs
StatePublished - Jun 2021
Externally publishedYes
Event22nd IEEE International Conference on Mobile Data Management, MDM 2021 - Virtual, Online
Duration: 15 Jun 202118 Jun 2021

Publication series

NameProceedings - IEEE International Conference on Mobile Data Management
Volume2021-June
ISSN (Print)1551-6245

Conference

Conference22nd IEEE International Conference on Mobile Data Management, MDM 2021
CityVirtual, Online
Period15/06/2118/06/21

Keywords

  • Bloom Filter
  • Compression Distance
  • Kolmogorov Complexity
  • Similarity Measure
  • Trajectories

Fingerprint

Dive into the research topics of 'Trajectory Similarity using Compression'. Together they form a unique fingerprint.

Cite this