Emotional evaluation of bandit problems

Johannes Feldmaier, Klaus Diepold

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we discuss an approach to evaluate decisions made during a multi-armed bandit learning experiment. Usually, the results of machine learning algorithms applied on multi-armed bandit scenarios are rated in terms of earned reward and optimal decisions taken. These criteria are valuable for objective comparison in finite experiments. But learning algorithms used in real scenarios, for example in robotics, need to have instantaneous criteria to evaluate their actual decisions taken. To overcome this problem, in our approach each decision updates the Zürich model which emulates the human sense of feeling secure and aroused. Combining these two feelings results in an emotional evaluation of decision policies and could be used to model the emotional state of an intelligent agent.

Original languageEnglish
Title of host publication22nd IEEE International Symposium on Robot and Human Interactive Communication
Subtitle of host publication"Living Together, Enjoying Together, and Working Together with Robots!", IEEE RO-MAN 2013
Pages149-154
Number of pages6
DOIs
StatePublished - 2013
Event22nd IEEE International Symposium on Robot and Human Interactive Communication: "Living Together, Enjoying Together, and Working Together with Robots!", IEEE RO-MAN 2013 - Gyeongju, Korea, Republic of
Duration: 26 Aug 201329 Aug 2013

Publication series

NameProceedings - IEEE International Workshop on Robot and Human Interactive Communication

Conference

Conference22nd IEEE International Symposium on Robot and Human Interactive Communication: "Living Together, Enjoying Together, and Working Together with Robots!", IEEE RO-MAN 2013
Country/TerritoryKorea, Republic of
CityGyeongju
Period26/08/1329/08/13

Fingerprint

Dive into the research topics of 'Emotional evaluation of bandit problems'. Together they form a unique fingerprint.

Cite this