A Compact and Efficient Neural Data Structure for Mutual Information Estimation in Large Timeseries

Fatemeh Farokhmanesh, Christoph Neuhauser, Rüdiger Westermann

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Database systems face challenges when using mutual information (MI) for analyzing non-linear relationships between large timeseries, due to computational and memory requirements. Interactive workflows are especially hindered by long response times. To address these challenges, we present timeseries neural MI fields (TNMIFs), a compact data structure that has been trained to reconstruct MI efficiently across various time-windows and window positions in large timeseries. We demonstrate learning and reconstruction with a large timeseries dataset comprising 1420 timeseries, each storing data at 1639 timesteps. While the learned data structure consumes only 45 megabytes, it answers queries for the MI estimates between the windows in a selected timeseries and the corresponding windows in all other timeseries within 44 milliseconds. Given a measure of similarity between timeseries based on windowed MI estimates, even the matrix showing all mutual timeseries similarities can be computed in less than 32 seconds. To support measuring dependence between lagged timeseries, an extended data structure learns to reconstruct MI to positively (future) and negatively (past) lagged windows. Using a maximum lag of 64 in both directions decreases query times by about a factor of 10.

Original languageEnglish
Title of host publicationScientific and Statistical Database Management
Subtitle of host publication36th International Conference, SSDBM 2024 - Proceedings
EditorsShadi Ibrahim, Suren Byna, Tristan Allard, Jay Lofstead, Amelie Chi Zhou, Tassadit Bouadi, Jalil Boukhobza, Diana Moise, Cedric Tedeschi, Jean Luca Bez
PublisherAssociation for Computing Machinery
ISBN (Electronic)9798400710209
DOIs
StatePublished - 10 Jul 2024
Event36th International Conference on Scientific and Statistical Database Management, SSDBM 2024 - Rennes, France
Duration: 10 Jul 202412 Jul 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference36th International Conference on Scientific and Statistical Database Management, SSDBM 2024
Country/TerritoryFrance
CityRennes
Period10/07/2412/07/24

Keywords

  • Mutual Information
  • Neural Data Structures
  • Timeseries Analysis

Fingerprint

Dive into the research topics of 'A Compact and Efficient Neural Data Structure for Mutual Information Estimation in Large Timeseries'. Together they form a unique fingerprint.

Cite this