TY - GEN
T1 - Cost optimization at early stages of design using deep reinforcement learning
AU - Servadei, Lorenzo
AU - Zheng, Jiapeng
AU - Arjona-Medina, José
AU - Werner, Michael
AU - Esen, Volkan
AU - Hochreiter, Sepp
AU - Ecker, Wolfgang
AU - Wille, Robert
N1 - Publisher Copyright:
© 2020 Association for Computing Machinery.
PY - 2020/11/16
Y1 - 2020/11/16
N2 - With the increase in the complexity of the modern System on Chips (SoCs) and the demand for a lower time-to-market, automation becomes essential in hardware design. This is particularly relevant in complex/time-consuming tasks, as the optimization of design cost for a hardware component. Design cost, in fact, may depend on several objectives, as for the hardware-software trade-off. Given the complexity of this task, the designer often has no means to perform a fast and effective optimization-in particular for larger and complex designs. In this paper, we introduce Deep Reinforcement Learning (DRL) for design cost optimization at the early stages of the design process. We first show that DRL is a perfectly suitable solution for the problem at hand. Afterwards, by means of a Pointer Network, a neural network specifically applied for combinatorial problems, we benchmark three DRL algorithms towards the selected problem. Results obtained in different settings show the improvements achieved by DRL algorithms compared to conventional optimization methods. Additionally, by using reward redistribution proposed in the recently introduced RUDDER method, we obtain significant improvements in complex designs.
AB - With the increase in the complexity of the modern System on Chips (SoCs) and the demand for a lower time-to-market, automation becomes essential in hardware design. This is particularly relevant in complex/time-consuming tasks, as the optimization of design cost for a hardware component. Design cost, in fact, may depend on several objectives, as for the hardware-software trade-off. Given the complexity of this task, the designer often has no means to perform a fast and effective optimization-in particular for larger and complex designs. In this paper, we introduce Deep Reinforcement Learning (DRL) for design cost optimization at the early stages of the design process. We first show that DRL is a perfectly suitable solution for the problem at hand. Afterwards, by means of a Pointer Network, a neural network specifically applied for combinatorial problems, we benchmark three DRL algorithms towards the selected problem. Results obtained in different settings show the improvements achieved by DRL algorithms compared to conventional optimization methods. Additionally, by using reward redistribution proposed in the recently introduced RUDDER method, we obtain significant improvements in complex designs.
KW - Design Automation
KW - Hardware-Software co-design
KW - Machine Learning
KW - Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=85098282230&partnerID=8YFLogxK
U2 - 10.1145/3380446.3430619
DO - 10.1145/3380446.3430619
M3 - Conference contribution
AN - SCOPUS:85098282230
T3 - MLCAD 2020 - Proceedings of the 2020 ACM/IEEE Workshop on Machine Learning for CAD
SP - 37
EP - 42
BT - MLCAD 2020 - Proceedings of the 2020 ACM/IEEE Workshop on Machine Learning for CAD
PB - Association for Computing Machinery, Inc
T2 - 2nd ACM/IEEE Workshop on Machine Learning for CAD, MLCAD 2020
Y2 - 16 November 2020 through 20 November 2020
ER -