Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

dc.contributor.authorSaxena, Yash
dc.contributor.authorPadia, Ankur
dc.contributor.authorChaudhary, Mandar S.
dc.contributor.authorGunaratna, Kalpa
dc.contributor.authorParthasarathy, Srinivasan
dc.contributor.authorGaur, Manas
dc.date.accessioned2025-06-17T14:45:37Z
dc.date.available2025-06-17T14:45:37Z
dc.date.issued2025-05-23
dc.description.abstractTraditional Retrieval-Augmented Generation (RAG) pipelines rely on similarity-based retrieval and re-ranking, which depend on heuristics such as top-k, and lack explainability, interpretability, and robustness against adversarial content. To address this gap, we propose a novel method METEORA that replaces re-ranking in RAG with a rationale-driven selection approach. METEORA operates in two stages. First, a general-purpose LLM is preference-tuned to generate rationales conditioned on the input query using direct preference optimization. These rationales guide the evidence chunk selection engine, which selects relevant chunks in three stages: pairing individual rationales with corresponding retrieved chunks for local relevance, global selection with elbow detection for adaptive cutoff, and context expansion via neighboring chunks. This process eliminates the need for top-k heuristics. The rationales are also used for consistency check using a Verifier LLM to detect and filter poisoned or misleading content for safe generation. The framework provides explainable and interpretable evidence flow by using rationales consistently across both selection and verification. Our evaluation across six datasets spanning legal, financial, and academic research domains shows that METEORA improves generation accuracy by 33.34% while using approximately 50% fewer chunks than state-of-the-art re-ranking methods. In adversarial settings, METEORA significantly improves the F1 score from 0.10 to 0.44 over the state-of-the-art perplexity-based defense baseline, demonstrating strong resilience to poisoning attacks. Code available at: https://anonymous.4open.science/r/METEORA-DC46/README.md
dc.description.urihttp://arxiv.org/abs/2505.16014
dc.format.extent24 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2ezhq-b5pl
dc.identifier.urihttps://doi.org/10.48550/arXiv.2505.16014
dc.identifier.urihttp://hdl.handle.net/11603/38921
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/deed.en
dc.subjectUMBC Ebiquity Research Group
dc.subjectComputer Science - Computation and Language
dc.subjectUMBC Accelerated Cognitive Cybersecurity Laboratory
dc.titleRanking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5411-2230

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2505.16014v3.pdf
Size:
1.38 MB
Format:
Adobe Portable Document Format