Approximation- and Quantization-Aware Training for Graph Neural Networks

Rodion Novkin, Florian Klemme, Hussam Amrouch

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Graph Neural Networks (GNNs) are one of the best-performing models for processing graph data. They are known to have considerable computational complexity, despite the smaller number of parameters compared to traditional Deep Neural Networks (DNNs). Operations-to-parameters ratio for GNNs can be tens and hundreds of times higher than for DNNs, depending on the input graph size. This complexity indicates the importance of arithmetic operation optimization within GNNs through model quantization and approximation. In this work, for the first time, we combine both approaches and implement quantization- and approximation-aware training for GNNs to sustain their accuracy under the errors induced by inexact multiplications. We employ matrix multiplication CUDA kernel to speed up the simulation of approximate multiplication within GNNs. Further, we demonstrate the execution speed, accuracy, and energy efficiency of GNNs with approximate multipliers in comparison with quantized low-bit GNNs. We evaluate the performance of state-of-the-art GNN architectures (i.e., GIN, SAGE, GCN, and GAT) on various datasets and tasks (i.e., Reddit-Binary, Collab for graph classification, Cora and PubMed for node classification) with a wide range of approximate multipliers. Our framework is available online: https://github.com/TUM-AIPro/AxC-GNN.

Original languageEnglish
Pages (from-to)599-612
Number of pages14
JournalIEEE Transactions on Computers
Volume73
Issue number2
DOIs
StatePublished - 1 Feb 2024

Keywords

  • Graph neural network
  • approximate computing
  • deep learning
  • quantization

Fingerprint

Dive into the research topics of 'Approximation- and Quantization-Aware Training for Graph Neural Networks'. Together they form a unique fingerprint.

Cite this