Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining

Ugur Sahin, Hang Li, Qadeer Khan, Daniel Cremers, Volker Tresp

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining'. Together they form a unique fingerprint.

Keyphrases

Arts and Humanities

Computer Science