TY - GEN
T1 - Images Speak Volumes
T2 - 3rd Workshop on Text Simplification, Accessibility and Readability, TSAR 2024
AU - Anschütz, Miriam
AU - Sylaj, Tringa
AU - Groh, Georg
N1 - Publisher Copyright:
© 2024 Association for Computational Linguistics.
PY - 2024
Y1 - 2024
N2 - Explanatory images play a pivotal role in accessible and easy-to-read (E2R) texts. However, the images available in online databases are not tailored toward the respective texts, and the creation of customized images is expensive. In this large-scale study, we investigated whether text-to-image generation models can close this gap by providing customizable images quickly and easily. We benchmarked seven, four open- and three closed-source, image generation models and provide an extensive evaluation of the resulting images. In addition, we performed a user study with people from the E2R target group to examine whether the images met their requirements. We find that some of the models show remarkable performance, but none of the models are ready to be used at a larger scale without human supervision. Our research is an important step toward facilitating the creation of accessible information for E2R creators and tailoring accessible images to the target group’s needs.
AB - Explanatory images play a pivotal role in accessible and easy-to-read (E2R) texts. However, the images available in online databases are not tailored toward the respective texts, and the creation of customized images is expensive. In this large-scale study, we investigated whether text-to-image generation models can close this gap by providing customizable images quickly and easily. We benchmarked seven, four open- and three closed-source, image generation models and provide an extensive evaluation of the resulting images. In addition, we performed a user study with people from the E2R target group to examine whether the images met their requirements. We find that some of the models show remarkable performance, but none of the models are ready to be used at a larger scale without human supervision. Our research is an important step toward facilitating the creation of accessible information for E2R creators and tailoring accessible images to the target group’s needs.
UR - http://www.scopus.com/inward/record.url?scp=85217083834&partnerID=8YFLogxK
U2 - 10.18653/v1/2024.tsar-1.4
DO - 10.18653/v1/2024.tsar-1.4
M3 - Conference contribution
AN - SCOPUS:85217083834
T3 - TSAR 2024 - 3rd Workshop on Text Simplification, Accessibility and Readability, Proceedings of the Workshop
SP - 27
EP - 40
BT - TSAR 2024 - 3rd Workshop on Text Simplification, Accessibility and Readability, Proceedings of the Workshop
A2 - Shardlow, Matthew
A2 - Saggion, Horacio
A2 - Alva-Manchego, Fernando
A2 - Zampieri, Marcos
A2 - North, Kai
A2 - Stajner, Sanja
A2 - Stodden, Regina
PB - Association for Computational Linguistics (ACL)
Y2 - 15 November 2024
ER -