A Transfer Learning Approach to Cross-Modal Object Recognition: From Visual Observation to Robotic Haptic Exploration

Pietro Falco, Shuang Lu, Ciro Natale, Salvatore Pirozzi, Dongheui Lee

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

In this paper, we introduce the problem of cross-modal visuo-tactile object recognition with robotic active exploration. With this term, we mean that the robot observes a set of objects with visual perception, and later on, it is able to recognize such objects only with tactile exploration, without having touched any object before. Using a machine learning terminology, in our application, we have a visual training set and a tactile test set, or vice versa. To tackle this problem, we propose an approach constituted by four steps: finding a visuo-tactile common representation, defining a suitable set of features, transferring the features across the domains, and classifying the objects. We show the results of our approach using a set of 15 objects, collecting 40 visual examples and five tactile examples for each object. The proposed approach achieves an accuracy of 94.7%, which is comparable with the accuracy of the monomodal case, i.e., when using visual data both as training set and test set. Moreover, it performs well compared to the human ability, which we have roughly estimated carrying out an experiment with ten participants.

Original languageEnglish
Article number8744477
Pages (from-to)987-998
Number of pages12
JournalIEEE Transactions on Robotics
Volume35
Issue number4
DOIs
StatePublished - Aug 2019
Externally publishedYes

Keywords

  • Cross-modal object recognition
  • robotic manipulation
  • tactile perception
  • visual perception

Fingerprint

Dive into the research topics of 'A Transfer Learning Approach to Cross-Modal Object Recognition: From Visual Observation to Robotic Haptic Exploration'. Together they form a unique fingerprint.

Cite this