Skip to main navigation Skip to search Skip to main content

Scaling Graph Neural Networks with Approximate PageRank

  • Aleksandar Bojchevski
  • , Johannes Klicpera
  • , Bryan Perozzi
  • , Amol Kapoor
  • , Martin Blais
  • , Benedek Rózemberczki
  • , Michal Lukasik
  • , Stephan Günnemann
  • Technical University of Munich
  • Google Inc

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

258 Scopus citations

Abstract

Graph neural networks (GNNs) have emerged as a powerful approach for solving many network mining tasks. However, learning on large graphs remains a challenge - many recently proposed scalable GNN approaches rely on an expensive message-passing procedure to propagate information through the graph. We present the PPRGo model which utilizes an efficient approximation of information diffusion in GNNs resulting in significant speed gains while maintaining state-of-the-art prediction performance. In addition to being faster, PPRGo is inherently scalable, and can be trivially parallelized for large datasets like those found in industry settings. We demonstrate that PPRGo outperforms baselines in both distributed and single-machine training environments on a number of commonly used academic graphs. To better analyze the scalability of large-scale graph learning methods, we introduce a novel benchmark graph with 12.4 million nodes, 173 million edges, and 2.8 million node features. We show that training PPRGo from scratch and predicting labels for all nodes in this graph takes under 2 minutes on a single machine, far outpacing other baselines on the same graph. We discuss the practical application of PPRGo to solve large-scale node classification problems at Google.

Original languageEnglish
Title of host publicationKDD 2020 - Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
PublisherAssociation for Computing Machinery
Pages2464-2473
Number of pages10
ISBN (Electronic)9781450379984
DOIs
StatePublished - 23 Aug 2020
Event26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2020 - Virtual, Online, United States
Duration: 23 Aug 202027 Aug 2020

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2020
Country/TerritoryUnited States
CityVirtual, Online
Period23/08/2027/08/20

Keywords

  • graph neural networks
  • personalized pagerank
  • scalability

Fingerprint

Dive into the research topics of 'Scaling Graph Neural Networks with Approximate PageRank'. Together they form a unique fingerprint.

Cite this