LLPut: Investigating Large Language Models for Bug Report-Based Input Generation

dc.contributor.authorHasan, Alif Al
dc.contributor.authorSaha, Subarna
dc.contributor.authorImran, Mia Mohammad
dc.contributor.authorZaman, Tarannum Shaila
dc.date.accessioned2025-06-05T14:02:41Z
dc.date.available2025-06-05T14:02:41Z
dc.date.issued2025-03-28
dc.description.abstractFailure-inducing inputs play a crucial role in diagnosing and analyzing software bugs. Bug reports typically contain these inputs, which developers extract to facilitate debugging. Since bug reports are written in natural language, prior research has leveraged various Natural Language Processing (NLP) techniques for automated input extraction. With the advent of Large Language Models (LLMs), an important research question arises: how effectively can generative LLMs extract failure-inducing inputs from bug reports? In this paper, we propose LLPut, a technique to empirically evaluate the performance of three open-source generative LLMs -- LLaMA, Qwen, and Qwen-Coder -- in extracting relevant inputs from bug reports. We conduct an experimental evaluation on a dataset of 206 bug reports to assess the accuracy and effectiveness of these models. Our findings provide insights into the capabilities and limitations of generative LLMs in automated bug diagnosis.
dc.description.sponsorshipThis work was supported in part by NSF grants CCF2348277 and CCF2518445
dc.description.urihttp://arxiv.org/abs/2503.20578
dc.format.extent8 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2ymbn-ka55
dc.identifier.urihttps://doi.org/10.48550/arXiv.2503.20578
dc.identifier.urihttp://hdl.handle.net/11603/38571
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Faculty Collection
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/deed.en
dc.subjectComputer Science - Software Engineering
dc.titleLLPut: Investigating Large Language Models for Bug Report-Based Input Generation
dc.typeText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
250320578v3.pdf
Size:
466.22 KB
Format:
Adobe Portable Document Format