One Sentence Can Kill the Bug: Auto-Replay Mobile App Crashes From One-Sentence Overviews

Yuchao Huang, Junjie Wang, Zhe Liu, Mingyang Li, Song Wang, Chunyang Chen, Yuanzhe Hu, Qing Wang

Research output: Contribution to journalArticlepeer-review

Abstract

Crash reports play a crucial role in software maintenance as they inform developers about the issues encountered in mobile applications. Developers must reproduce the reported crash before fixing it, which is extremely time-consuming and tedious. Existing studies have focused on automatic crash reproduction with step-by-step instructions. However, a non-neglectable portion of crash reports only provides a one-sentence overview, which merely describes the final crash-triggering action. These reports require developers to invest more effort in understanding and fixing the issues while existing techniques cannot handle them due to the lack of step-by-step guidance, thus calling for a greater need for automatic support. Leveraging the capability of Large Language Models (LLMs) in combining acting and reasoning, we propose ReActDroid, an automated approach to reproduce mobile application crashes directly from the crash overview. ReActDroid utilizes ReAct prompting to augment the app-specific knowledge and exploration history, enabling the LLM to derive the necessary steps for triggering the crash from a comprehensive and historical perspective. We evaluate ReActDroid on 102 crash reports from 69 popular Android apps and successfully reproduce 57.8% of the crashes, surpassing the performance of state-of-the-art baselines by 69% to 321%. Besides, the average reproducing time is 51.8 seconds, outperforming the baselines by 73% to 949%. We also evaluate the usefulness of ReActDroid with promising results.

Original languageEnglish
Pages (from-to)975-989
Number of pages15
JournalIEEE Transactions on Software Engineering
Volume51
Issue number4
DOIs
StatePublished - 2025

Keywords

  • Mobile application testing
  • issue report
  • large language model

Fingerprint

Dive into the research topics of 'One Sentence Can Kill the Bug: Auto-Replay Mobile App Crashes From One-Sentence Overviews'. Together they form a unique fingerprint.

Cite this