AtlasHDF: An Efficient Big Data Framework for GeoAI

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

The last decade witnesses a fast development in geospatial application of artificial intelligence (GeoAI). However, due to the misalignment with wider computer science progresses, the geospatial community, for a long time, keeps working with powerful and over-sophisticated tools and software, whose functionality goes far beyond the actual basic need of GeoAI tasks. This fact, to a certain extent, hinders our steps towards establishing future sustainable and replicable GeoAI models. In this paper, we aim to address this challenge by introducing an efficient big data framework based on the modern HDF5 technology, called AtlasHDF, in which we designed lossless data mappings (immediate mapping and analysis-ready mapping) from OpenStreetMap (OSM) vector data into a single HDF5 data container to facilitate fast and flexible GeoAI applications learnt from OSM data. Since the HDF5 is included as a default dependency in most GeoAI and high performance computing (HPC) environments, the proposed AtlasHDF provides a cross-platformm and single-techonology solution of handling heterogeneous big geodata for GeoAI. As a case study, we conducted a comparative analysis of the AtlasHDF framework with three commonly-used data formats (i.e., PBF, Shapefile and GeoPackage) using the latest OSM data from the city of Berlin (Germany), then elaborated on the advantages of each data format w.r.t file size, querying, rending, dependency, data extendability. Given a wide range of GeoAI tasks that can potentially benefit from our framework, our future work will focus on extending the framework to heterogeneous big geodata (vector and raster) to support seamless and fast data integration without any geospatial software dependency until the training stage of GeoAI. A reference implementation of the framework developed in this paper is provided to the public at: https://github.com/tumbgd/hdf4water.

Original languageEnglish
Title of host publicationProceedings of the 10th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2022
EditorsAshwin Shashidharan, Krishna Karthik Gadiraju, Varun Chandola, Ranga Raju Vatsavai
PublisherAssociation for Computing Machinery, Inc
Pages1-7
Number of pages7
ISBN (Electronic)9781450395311
DOIs
StatePublished - 1 Nov 2022
Event10th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2022 - Seattle, United States
Duration: 1 Nov 2022 → …

Publication series

NameProceedings of the 10th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2022

Conference

Conference10th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2022
Country/TerritoryUnited States
CitySeattle
Period1/11/22 → …

Keywords

  • GeoAI
  • OpenStreetMap
  • big data
  • hierarchical data format
  • immediate mapping

Fingerprint

Dive into the research topics of 'AtlasHDF: An Efficient Big Data Framework for GeoAI'. Together they form a unique fingerprint.

Cite this