TY - JOUR
T1 - Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes
AU - Niu, Mingyue
AU - Zhao, Ziping
AU - Tao, Jianhua
AU - Li, Ya
AU - Schuller, Bjorn W.
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2022/11/1
Y1 - 2022/11/1
N2 - Physiological studies have shown that healthy and depressed individuals present different facial changes. Thus, many researchers have attempted to use Convolutional Neural Networks (CNNs) to extract high-level facial dynamic representations for predicting depression severity. However, the max-pooling (or average-pooling) layers in the CNN lead to the loss of subtle depression cues. Without pooling layers, the CNN cannot extract multi-scale information and has difficulties for tensor vectorization. To this end, we propose a Selective Element and Two Orders Vectorization (SE-TOV) network. For the SE-TOV network, an SE block is constructed to adaptively select the effective elements from the tensors obtained by receptive fields of different sizes. Moreover, we propose a TOV block for vectorizing a high-dimensional tensor. On the one hand, TOV block inputs a tensor into the Global Average Pooling layer to obtain the first-order vectorization result. On the other hand, it takes principal components of the correlation matrix of channels in a tensor as the second-order vectorization result. Experimental results on AVEC 2013 (RMSE =7.42, MAE =6.09) and AVEC 2014 (RMSE =7.39, MAE =5.87) depression databases illustrate the superiority of our approach over previous works.
AB - Physiological studies have shown that healthy and depressed individuals present different facial changes. Thus, many researchers have attempted to use Convolutional Neural Networks (CNNs) to extract high-level facial dynamic representations for predicting depression severity. However, the max-pooling (or average-pooling) layers in the CNN lead to the loss of subtle depression cues. Without pooling layers, the CNN cannot extract multi-scale information and has difficulties for tensor vectorization. To this end, we propose a Selective Element and Two Orders Vectorization (SE-TOV) network. For the SE-TOV network, an SE block is constructed to adaptively select the effective elements from the tensors obtained by receptive fields of different sizes. Moreover, we propose a TOV block for vectorizing a high-dimensional tensor. On the one hand, TOV block inputs a tensor into the Global Average Pooling layer to obtain the first-order vectorization result. On the other hand, it takes principal components of the correlation matrix of channels in a tensor as the second-order vectorization result. Experimental results on AVEC 2013 (RMSE =7.42, MAE =6.09) and AVEC 2014 (RMSE =7.39, MAE =5.87) depression databases illustrate the superiority of our approach over previous works.
KW - Depression severity diagnosis
KW - facial changes
KW - representation sequence
KW - selective element block
KW - two orders vectorization block
UR - http://www.scopus.com/inward/record.url?scp=85132792089&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2022.3182658
DO - 10.1109/TCSVT.2022.3182658
M3 - Article
AN - SCOPUS:85132792089
SN - 1051-8215
VL - 32
SP - 8065
EP - 8077
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 11
ER -