TY - GEN
T1 - Porting Deep Spiking Q-Networks to neuromorphic chip Loihi
AU - Akl, Mahmoud
AU - Sandamirskaya, Yulia
AU - Walter, Florian
AU - Knoll, Alois
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/7/27
Y1 - 2021/7/27
N2 - Deep neural networks (DNNs) set the benchmark in many tasks in perception and control. Spiking versions of DNNs, implemented on neuromorphic hardware can enable orders of magnitude lower power consumption and low latency during network use. In this paper, we explore behavior and generalization capability of spiking, quantized spiking, and hardware implementation of deep Q-networks in two classical reinforcement learning tasks. We found that spiking neural networks have slightly decreased performance compared to non-spiking network, but we can avoid performance degradation from quantization and in-chip implementation. We conclude that since hardware implementation leads to lower power consumption and low latency, neuromorphic approach is a promising avenue for deep Q-learning. Furthermore, online learning, enabled in neuromorphic chips, can be used to compensate for the performance decrease in environments with parameter variations.
AB - Deep neural networks (DNNs) set the benchmark in many tasks in perception and control. Spiking versions of DNNs, implemented on neuromorphic hardware can enable orders of magnitude lower power consumption and low latency during network use. In this paper, we explore behavior and generalization capability of spiking, quantized spiking, and hardware implementation of deep Q-networks in two classical reinforcement learning tasks. We found that spiking neural networks have slightly decreased performance compared to non-spiking network, but we can avoid performance degradation from quantization and in-chip implementation. We conclude that since hardware implementation leads to lower power consumption and low latency, neuromorphic approach is a promising avenue for deep Q-learning. Furthermore, online learning, enabled in neuromorphic chips, can be used to compensate for the performance decrease in environments with parameter variations.
KW - Spiking neural networks
KW - neuromorphic hardware
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85117892054&partnerID=8YFLogxK
U2 - 10.1145/3477145.3477159
DO - 10.1145/3477145.3477159
M3 - Conference contribution
AN - SCOPUS:85117892054
T3 - ACM International Conference Proceeding Series
BT - ICONS 2021 - Proceedings of International Conference on Neuromorphic Systems 2021
PB - Association for Computing Machinery
T2 - 2021 International Conference on Neuromorphic Systems, ICONS 2021
Y2 - 27 July 2021 through 29 July 2021
ER -