TY - JOUR
T1 - Diba
T2 - A Re-Configurable Stream Processor
AU - Najafi, Mohammadreza
AU - Qadah, Thamir M.
AU - Sadoghi, Mohammad
AU - Jacobsen, Hans Arno
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - Stream processing acceleration is driven by the continuously increasing volume and velocity of data generated on the Web and the limitations of storage, computation, and power consumption. Hardware solutions provide better performance and power consumption, but they are hindered by the high research and development costs and the long time to market. In this work, we propose our re-configurable stream processor (Diba), a complete rethinking of a previously proposed customized and flexible query processor that targets real-time stream processing. Diba uses a unidirectional dataflow not dedicated to any specific type of query (operator) on streams, allowing a straightforward placement of processing components on a general data path that facilitates query mapping. In Diba, the concepts of the distribution network and processing components are implemented as two separate entities connected using generic interfaces. This approach allows the adoption of a versatile architecture for a family of queries rather than forcing a rigid chain of processing components to implement such queries. Our experimental evaluations of representative queries from TPC-H yielded processing times of 300, 1220, and 3520 milliseconds for data streams with scale factor sizes of one, four, and ten gigabytes, respectively.
AB - Stream processing acceleration is driven by the continuously increasing volume and velocity of data generated on the Web and the limitations of storage, computation, and power consumption. Hardware solutions provide better performance and power consumption, but they are hindered by the high research and development costs and the long time to market. In this work, we propose our re-configurable stream processor (Diba), a complete rethinking of a previously proposed customized and flexible query processor that targets real-time stream processing. Diba uses a unidirectional dataflow not dedicated to any specific type of query (operator) on streams, allowing a straightforward placement of processing components on a general data path that facilitates query mapping. In Diba, the concepts of the distribution network and processing components are implemented as two separate entities connected using generic interfaces. This approach allows the adoption of a versatile architecture for a family of queries rather than forcing a rigid chain of processing components to implement such queries. Our experimental evaluations of representative queries from TPC-H yielded processing times of 300, 1220, and 3520 milliseconds for data streams with scale factor sizes of one, four, and ten gigabytes, respectively.
KW - Computer architecture
KW - Dataflow architecture
KW - Engines
KW - Field programmable gate arrays
KW - Hardware acceleration
KW - Libraries
KW - Pipelines
KW - Query processing
KW - hardware architecture
KW - multiple data stream architecture
UR - http://www.scopus.com/inward/record.url?scp=85189520386&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2024.3381192
DO - 10.1109/TKDE.2024.3381192
M3 - Article
AN - SCOPUS:85189520386
SN - 1041-4347
SP - 1
EP - 15
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
ER -