RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models
Paper β’ 2602.17053 β’ Published β’ 1
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
A Neuro-Symbolic Architecture for Faithful Multi-Hop Reasoning via Differentiable Graph Traversal and Discrete DAG Extraction
This repository contains the complete whitepaper for the Graph-Inference Engine (GIE), a neuro-symbolic model that navigates and agentically evolves a continuous world-graph to resolve queries by outputting discrete Directed Acyclic Graph (DAG) reasoning traces.
π Read the full whitepaper β
| Phase | Innovation | Mathematical Foundation |
|---|---|---|
| Phase 1 | Differentiable Graph Traversal | Neural Bellman-Ford over learned semirings (NBFNet) + NOTEARS DAG extraction |
| Phase 2 | Hybrid RL Training | GFlowNet Trajectory Balance with path-refined reward (reachability + parsimony + integrity) |
| Phase 3 | Agentic Graph Evolution | Edge Proposal Network + Sheaf Laplacian pre-filter + Datalog/PSL symbolic gatekeeper |
| Phase 4 | Reasoning Faithfulness Metrics | Counterfactual Intervention Testing (CIT) + DAG-Consistency Index (DCI) |
| Phase 5 | Computational Advantage | O(kΒ·MΒ·dΒ²) linear vs O(kΒ²Β·tΒ²Β·dΒ²_model) quadratic; ~4,000Γ speedup at 20-hop |
Query Q β [Entity Grounding] β [NBFNet Traversal (T iterations)]
β
[Edge Relevance Scores Ο_q(e)]
β
[NOTEARS DAG Extraction] β D_Q
β
[Dead-End?] β YES β [Edge Proposal Network]
β β
NO [Gatekeeper: Sheaf + Datalog + PSL]
β β
[Answer Decode] [Expand G, Re-traverse]
β β
(v_answer, D_Q) ββββββββββ
@techreport{gie2026,
title={The Graph-Inference Engine: A Neuro-Symbolic Architecture for Faithful Multi-Hop Reasoning},
year={2026},
note={Technical whitepaper}
}