TY - GEN
T1 - Interactive RGB Image Segmentation via Depth-modified Click Encoding and Estimated Depth
AU - Kaynar, Furkan
AU - Michl, Adrian
AU - Steinbach, Eckehard
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Interactive image segmentation separates the object of interest in the scene from the background, using the help of human annotations. Click-based interactive segmentation methods typically receive positive and negative clicks from the user indicating the object of interest and the background. These clicks are encoded into click-maps to be processed further by a neural network. Although the depth information is known to improve the image segmentation accuracy, the previous work encodes only the locations of the clicks into the click-maps. We propose two novel click-map generation methods that modify the conventional click-maps using relative depth information. This depth information is estimated from the monocular RGB image. After retraining the baseline interactive segmentation method with our novel click-maps, the segmentation accuracy improved without requiring any additional input or increasing the network size. Experimental evaluations showed that our method yields a better mean segmentation accuracy on the Berkeley and DAVIS datasets than the baseline using conventional click-maps, and a comparable performance on the GrabCut dataset.
AB - Interactive image segmentation separates the object of interest in the scene from the background, using the help of human annotations. Click-based interactive segmentation methods typically receive positive and negative clicks from the user indicating the object of interest and the background. These clicks are encoded into click-maps to be processed further by a neural network. Although the depth information is known to improve the image segmentation accuracy, the previous work encodes only the locations of the clicks into the click-maps. We propose two novel click-map generation methods that modify the conventional click-maps using relative depth information. This depth information is estimated from the monocular RGB image. After retraining the baseline interactive segmentation method with our novel click-maps, the segmentation accuracy improved without requiring any additional input or increasing the network size. Experimental evaluations showed that our method yields a better mean segmentation accuracy on the Berkeley and DAVIS datasets than the baseline using conventional click-maps, and a comparable performance on the GrabCut dataset.
KW - Interactive Segmentation
KW - RGB-D Segmentation
KW - Segmentation with Estimated Depth Image
KW - User Click Encoding
UR - http://www.scopus.com/inward/record.url?scp=85147551466&partnerID=8YFLogxK
U2 - 10.1109/ISM55400.2022.00008
DO - 10.1109/ISM55400.2022.00008
M3 - Conference contribution
AN - SCOPUS:85147551466
T3 - Proceedings - 2022 IEEE International Symposium on Multimedia, ISM 2022
SP - 10
EP - 17
BT - Proceedings - 2022 IEEE International Symposium on Multimedia, ISM 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th IEEE International Symposium on Multimedia, ISM 2022
Y2 - 5 December 2022 through 7 December 2022
ER -