GloBiMaps-a probabilistic data structure for in-memory processing of global raster datasets

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In the last decade, more and more spatial data has been acquired on a global scale due to satellite missions, social media, and coordinated governmental activities. This observational data suffers from huge storage footprints and makes global analysis challenging. Therefore, many information products have been designed in which observations are turned into global maps showing features such as land cover or land use, often with only a few discrete values and sparse spatial coverage like only within cities. Traditional coding of such data as a raster image becomes challenging due to the sizes of the datasets and spatially non-local access patterns, for example, when labeling social media streams. This paper proposes GloBiMap, a randomized data structure, based on Bloom filters, for modeling low-cardinality sparse raster images of excessive sizes in a configurable amount of memory with pure random access operations avoiding costly intermediate decompression. In addition, the data structure is designed to correct the inevitable errors of the randomized layer in order to have a fully exact representation. We show the feasibility of the approach on several real-world data sets including the Global Urban Footprint in which each pixel denotes whether a particular location contains a building at a resolution of roughly 10cm globally as well as on a global Twitter sample of more than 220 million precisely geolocated tweets.

Original languageEnglish
Title of host publication27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2019
EditorsFarnoush Banaei-Kashani, Goce Trajcevski, Ralf Hartmut Guting, Lars Kulik, Shawn Newsam
PublisherAssociation for Computing Machinery
Pages3-12
Number of pages10
ISBN (Electronic)9781450369091
DOIs
StatePublished - 5 Nov 2019
Externally publishedYes
Event27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2019 - Chicago, United States
Duration: 5 Nov 20198 Nov 2019

Publication series

NameGIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems

Conference

Conference27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2019
Country/TerritoryUnited States
CityChicago
Period5/11/198/11/19

Keywords

  • Data Sparsity and Compression
  • Geographic Information Systems
  • Image Representation
  • Randomized Data Structures

Fingerprint

Dive into the research topics of 'GloBiMaps-a probabilistic data structure for in-memory processing of global raster datasets'. Together they form a unique fingerprint.

Cite this