Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition

Chi Jin, Tiancheng Jin, Haipeng Luo, Suvrit Sra, Tiancheng Yu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

45 Scopus citations

Fingerprint

Dive into the research topics of 'Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition'. Together they form a unique fingerprint.

Keyphrases

Computer Science