CellularSpecSec-Bench: A Staged Benchmark for Evidence-Grounded Interpretation and Security Reasoning over 3GPP Specifications
| dc.contributor.author | Xie, Ke | |
| dc.contributor.author | Zhao, Xingyi | |
| dc.contributor.author | Hu, Yiwen | |
| dc.contributor.author | Yuan, Shuhan | |
| dc.contributor.author | Xie, Tian | |
| dc.date.accessioned | 2026-02-12T16:43:44Z | |
| dc.date.issued | 2026-01-19 | |
| dc.description.abstract | Cellular networks are critical infrastructure supporting billions of worldwide users and safety- and mission-critical services. Vulnerabilities in cellular networks can therefore cause service disruption, privacy breaches, and broad societal harm, motivating growing efforts to analyze 3GPP specifications that define required device and operator behavior. While large language models (LLMs) have demonstrated the capability for reading technical documents, cellular specifications impose unique challenges: faithful interpretation of normative language, reasoning across cross-referenced clauses, and verifiable conclusions grounded in multimodal evidence such as tables and figures. To address these challenges, we propose CellSpecSec-ARI, a unified Adapt-Retrieve-Integrate framework for systematic understanding and standard-driven security analysis of 3GPP specifications; CellularSpecSec-Bench, a staged benchmark, containing newly constructed high-quality datasets with expert-verified and corrected subsets from prior open-source resources. Together, they establish an accessible and reproducible foundation for quantifying progress in specification understanding and security reasoning in the cellular network security domain. | |
| dc.description.uri | http://arxiv.org/abs/2601.12716 | |
| dc.format.extent | 25 pages | |
| dc.genre | journal articles | |
| dc.genre | preprints | |
| dc.identifier | doi:10.13016/m2nmgv-73ue | |
| dc.identifier.uri | https://doi.org/10.48550/arXiv.2601.12716 | |
| dc.identifier.uri | http://hdl.handle.net/11603/41847 | |
| dc.language.iso | en | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.relation.ispartof | UMBC Faculty Collection | |
| dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
| dc.subject | Computer Science - Cryptography and Security | |
| dc.subject | UMBC Cyber Defense Lab (CDL) | |
| dc.subject | UMBC Cybersecruity Institute | |
| dc.subject | UMBC Cybersecruity Institute | |
| dc.subject | UMBC Cyber Defense Lab (CDL) | |
| dc.title | CellularSpecSec-Bench: A Staged Benchmark for Evidence-Grounded Interpretation and Security Reasoning over 3GPP Specifications | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0000-0002-8790-5579 |
Files
Original bundle
1 - 1 of 1
