TY - GEN
T1 - A tailored regression for learned indexes
T2 - 4th International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, aiDM 2021
AU - Eppert, Martin
AU - Fent, Philipp
AU - Neumann, Thomas
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/6/20
Y1 - 2021/6/20
N2 - Although linear regressions are essential for learned index structures, most implementations use Simple Linear Regression, which optimizes the squared error. Since learned indexes use exponential search, regressions that optimize the logarithmic error are much better tailored for the use-case. By using this fitting optimization target, we can significantly improve learned index's lookup performance with no architectural changes. While the log-error is harder to optimize, our novel algorithms and optimization heuristics can bring a practical performance improvement of the lookup latency. Even in cases where fast build times are paramount, log-error regressions still provide a robust fallback for degenerated leaf models. The resulting regressions are much better suited for learned indexes, and speed up lookups on data sets with outliers by over a factor of 2.
AB - Although linear regressions are essential for learned index structures, most implementations use Simple Linear Regression, which optimizes the squared error. Since learned indexes use exponential search, regressions that optimize the logarithmic error are much better tailored for the use-case. By using this fitting optimization target, we can significantly improve learned index's lookup performance with no architectural changes. While the log-error is harder to optimize, our novel algorithms and optimization heuristics can bring a practical performance improvement of the lookup latency. Even in cases where fast build times are paramount, log-error regressions still provide a robust fallback for degenerated leaf models. The resulting regressions are much better suited for learned indexes, and speed up lookups on data sets with outliers by over a factor of 2.
UR - http://www.scopus.com/inward/record.url?scp=85109854071&partnerID=8YFLogxK
U2 - 10.1145/3464509.3464891
DO - 10.1145/3464509.3464891
M3 - Conference contribution
AN - SCOPUS:85109854071
T3 - Proceedings of the 4th International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, aiDM 2021
SP - 9
EP - 15
BT - Proceedings of the 4th International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, aiDM 2021
PB - Association for Computing Machinery, Inc
Y2 - 20 June 2021 through 25 June 2021
ER -