Knowledge Transfer for Label-Efficient Monocular Height Estimation

Zhitong Xiong, Xiao Xiang Zhu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Estimating height from monocular remote sensing images is one of the most efficient ways for building large-scale 3D city models. However, existing deep learning based methods usu-ally require a large amount of training data, which could be cost-consuming or even not possible to obtain. Towards a label-efficient deep learning model, we propose a new task and dataset for weak-shot monocular height estimation. In this task, only the relative height labels between pairs of a small portion of points are given, which is cheaper and more friendly for humans to annotate. In addition, to enhance the model performance under the sparse and weak-shot super-vision, we propose a Transformer-based network for trans-ferring the learned knowledge from a large-scale synthetic dataset to real-world data. Experimental results have shown the effectiveness of the proposed method on a public dataset under the sparse and weak supervision.

Original languageEnglish
Title of host publicationIGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5377-5380
Number of pages4
ISBN (Electronic)9781665427920
DOIs
StatePublished - 2022
Event2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022 - Kuala Lumpur, Malaysia
Duration: 17 Jul 202222 Jul 2022

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)
Volume2022-July

Conference

Conference2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022
Country/TerritoryMalaysia
CityKuala Lumpur
Period17/07/2222/07/22

Keywords

  • Monocular height estimation
  • relation modeling
  • transfer learning
  • weakly-supervised Learning

Fingerprint

Dive into the research topics of 'Knowledge Transfer for Label-Efficient Monocular Height Estimation'. Together they form a unique fingerprint.

Cite this