DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models

Jiang, Yuxuan; Li, Dawei; Ferraro, Francis

DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models

dc.contributor.author	Jiang, Yuxuan
dc.contributor.author	Li, Dawei
dc.contributor.author	Ferraro, Francis
dc.date.accessioned	2025-06-17T14:45:32Z
dc.date.available	2025-06-17T14:45:32Z
dc.date.issued	2025-05-20
dc.description.abstract	While Large Reasoning Models (LRMs) have demonstrated success in complex reasoning tasks through long chain-of-thought (CoT) reasoning, their inference often involves excessively verbose reasoning traces, resulting in substantial inefficiency. To address this, we propose Distilled Reasoning Pruning (DRP), a hybrid framework that combines inference-time pruning with tuning-based distillation, two widely used strategies for efficient reasoning. DRP uses a teacher model to perform skill-aware step decomposition and content pruning, and then distills the pruned reasoning paths into a student model, enabling it to reason both efficiently and accurately. Across several challenging mathematical reasoning datasets, we find that models trained with DRP achieve substantial improvements in token efficiency without sacrificing accuracy. Specifically, DRP reduces average token usage on GSM8K from 917 to 328 while improving accuracy from 91.7% to 94.1%, and achieves a 43% token reduction on AIME with no performance drop. Further analysis shows that aligning the reasoning structure of training CoTs with the student's reasoning capacity is critical for effective knowledge transfer and performance gains.
dc.description.uri	http://arxiv.org/abs/2505.13975
dc.format.extent	14 pages
dc.genre	journal articles
dc.genre	preprints
dc.identifier	doi:10.13016/m2g343-km2p
dc.identifier.uri	https://doi.org/10.48550/arXiv.2505.13975
dc.identifier.uri	http://hdl.handle.net/11603/38907
dc.language.iso	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.relation.ispartof	UMBC Student Collection
dc.rights	Attribution 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Computer Science - Computation and Language
dc.subject	UMBC Interactive Robotics and Language Lab
dc.title	DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models
dc.type	Text
dcterms.creator	https://orcid.org/0009-0007-8488-3056
dcterms.creator	https://orcid.org/0000-0003-2413-9368

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2505.13975v2.pdf
Size:: 2.17 MB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department
UMBC Student Collection