Enhancing Trustworthiness in LLM-Generated Code: A Reinforcement Learning and Domain-Knowledge Constrained Approach

dc.contributor.authorPiplai, Aritran
dc.contributor.authorKotal, Anantaa
dc.contributor.authorMittal, Sudip
dc.contributor.authorJoshi, Karuna
dc.contributor.authorFinin, Tim
dc.contributor.authorJoshi, Anupam
dc.date.accessioned2026-03-26T14:26:55Z
dc.date.issued2025-02
dc.descriptionUMBC CODEBOT '25 Workshop, Columbia, MD , 25-26 February 2025
dc.description.abstractImagine analyzing a piece of code that uses the function ConnectToServer() withan encrypted string as its argument. A large language model (LLM), trained onextensive programming data, might flag the use of encryption as suspicious andgenerate an explanation suggesting that the function likely connects to a maliciousserver. While this explanation might seem plausible, it can often be unfaithful—itovergeneralizes based on statistical patterns from its training data without trulyunderstanding the context or validating its claims [8]. A REACT (Reasoning andActing) framework, which combines reasoning with action steps, is likely a betterapproach because it allows the LLM to propose actions—such as decrypting the stringor examining server connections—while reasoning about the results [7]. However,REACT still lacks a feedback mechanism to evaluate the effectiveness of thoseactions or iteratively refine the sequence based on empirical observations. Without such feedback, it risks falling short in dynamic scenarios, where the validation of predictions and adaptation to new evidence are critical [10].
dc.description.urihttps://ebiquity.umbc.edu/_file_directory_/papers/1428.pdf
dc.format.extent4 pages
dc.genreconference papers and proceedings
dc.genrepreprints
dc.identifier.urihttp://hdl.handle.net/11603/42288
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Center for Cybersecurity
dc.relation.ispartofUMBC Information Systems Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectUMBC Ebiquity Researh Group
dc.subjectUMBC Accelerated Cognitive Cybersecurity Laboratory
dc.subjectUMBC Cybersecurity Institute
dc.titleEnhancing Trustworthiness in LLM-Generated Code: A Reinforcement Learning and Domain-Knowledge Constrained Approach
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-6354-1686
dcterms.creatorhttps://orcid.org/0000-0002-6593-1792
dcterms.creatorhttps://orcid.org/0000-0002-8641-3193

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1428.pdf
Size:
188.95 KB
Format:
Adobe Portable Document Format