TY - GEN
T1 - Analysis on Temporal Dimension of Inputs for 3D Convolutional Neural Networks
AU - Köpüklü, Okan
AU - Rigoll, Gerhard
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - 3D ConvNets provide a dedicated spatiotemporal representation in order to incorporate motion patterns within video frames. However, compared to 2D convolutions, the 3D convolution kernels increase the number of parameters in the architecture and the floating point operations during inference time, which are of critical importance for real-time applications requiring faster runtime. In this paper, we show a sparse sampling and stacking strategy to span large time intervals for 3D ConvNet architectures that can attain multiple times less inference time by relinquishing little amount of classification accuracy. The proposed approach is validated on action and gesture recognition tasks using two recent video datasets: Jester and Something-Something datasets.
AB - 3D ConvNets provide a dedicated spatiotemporal representation in order to incorporate motion patterns within video frames. However, compared to 2D convolutions, the 3D convolution kernels increase the number of parameters in the architecture and the floating point operations during inference time, which are of critical importance for real-time applications requiring faster runtime. In this paper, we show a sparse sampling and stacking strategy to span large time intervals for 3D ConvNet architectures that can attain multiple times less inference time by relinquishing little amount of classification accuracy. The proposed approach is validated on action and gesture recognition tasks using two recent video datasets: Jester and Something-Something datasets.
UR - http://www.scopus.com/inward/record.url?scp=85066326690&partnerID=8YFLogxK
U2 - 10.1109/IPAS.2018.8708895
DO - 10.1109/IPAS.2018.8708895
M3 - Conference contribution
AN - SCOPUS:85066326690
T3 - IEEE 3rd International Conference on Image Processing, Applications and Systems, IPAS 2018
SP - 79
EP - 84
BT - IEEE 3rd International Conference on Image Processing, Applications and Systems, IPAS 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE International Conference on Image Processing, Applications and Systems, IPAS 2018
Y2 - 12 December 2018 through 14 December 2018
ER -