In-database machine learning: Gradient descent and tensor algebra for main memory database systems

Maximilian Schüle, Frédéric Simonis, Thomas Heyenbrock, Alfons Kemper, Stephan Günnemann, Thomas Neumann

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Machine learning tasks such as regression, clustering, and classification are typically performed outside of database systems using dedicated tools, necessitating the extraction, transformation, and loading of data. We argue that database systems when extended to enable automatic differentiation, gradient descent, and tensor algebra are capable of solving machine learning tasks more efficiently by eliminating the need for costly data communication. We demonstrate our claim by implementing tensor algebra and stochastic gradient descent using lambda expressions for loss functions as a pipelined operator in a main memory database system. Our approach enables common machine learning tasks to be performed faster than by extended disk-based database systems or as well as dedicated tools by eliminating the time needed for data extraction. This work aims to incorporate gradient descent and tensor data types into database systems, allowing them to handle a wider range of computational tasks.

Original languageEnglish
Title of host publicationDatenbanksysteme fur Business, Technologie und Web, BTW 2019 and 18. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2019
EditorsTorsten Grust, Felix Naumann, Alexander Bohm, Wolfgang Lehner, Theo Harder, Erhard Rahm, Andreas Heuer, Meike Klettke, Holger Meyer
PublisherGesellschaft fur Informatik (GI)
Pages247-266
Number of pages20
ISBN (Electronic)9783885796831
DOIs
StatePublished - 2019
EventDatenbanksysteme fur Business, Technologie und Web, BTW 2019 and 18. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2019 - Database Systems for Business, Technology and Web, BTW 2019 and 18th Symposium of the GI Department "Databases and Information Systems", DBIS 2019 - Rostock, Germany
Duration: 4 Mar 20198 Mar 2019

Publication series

NameLecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
VolumeP-289
ISSN (Print)1617-5468
ISSN (Electronic)2944-7682

Conference

ConferenceDatenbanksysteme fur Business, Technologie und Web, BTW 2019 and 18. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2019 - Database Systems for Business, Technology and Web, BTW 2019 and 18th Symposium of the GI Department "Databases and Information Systems", DBIS 2019
Country/TerritoryGermany
CityRostock
Period4/03/198/03/19

Fingerprint

Dive into the research topics of 'In-database machine learning: Gradient descent and tensor algebra for main memory database systems'. Together they form a unique fingerprint.

Cite this