YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Qwen3-8B-Eagle3-NeMoRL-RedhatAI-thinking
EAGLE-3 drafter for Qwen/Qwen3-8B, converted from speculators format to NeMo-RL-compatible raw-eagle3 format. Private research artifact for the EfficientRollout (NeurIPS 2026) EAGLE-3 baseline reproduction in NeMo-RL native.
- Upstream: RedHatAI/Qwen3-8B-Thinking-speculator.eagle3, trained by RedHat with the vLLM/speculators library on Magpie-Pro-300K-Filtered + UltraChat-200k, thinking enabled.
- Conversion: tensor values unchanged.
- config: nested speculators config -> flat raw-eagle3 config +
model_type: llama. - weight keys:
layers.0.*->midlayer.*. embed_tokens.weightdropped because NeMo-RL/vLLM reuse the target model embeddings.
- config: nested speculators config -> flat raw-eagle3 config +
eagle_aux_hidden_state_layer_ids: [2, 18, 33]injected. The upstream config contains the key but sets it tonull; this mirror makes the Qwen3 36-layer speculators/vLLM convention explicit:(2, n//2, n-3).norm_before_residual: truepreserved from upstream. This is required for speculators-format RedHat EAGLE3 drafters; omitting it caused acceptance collapse in earlier sanity checks.rope_theta: 10000.0preserved from upstreamtransformer_layer_config. This differs from the non-thinking RedHat Qwen3 drafter mirror but matches this thinking drafter's public config.
Reported upstream k=3 acceptance lengths:
| dataset | k=3 AL |
|---|---|
| HumanEval | 2.90 |
| math_reasoning | 3.04 |
| qa | 2.67 |
| question | 2.78 |
| rag | 2.69 |
Usage (NeMo-RL):
policy:
draft:
model_name: minseokim25/Qwen3-8B-Eagle3-NeMoRL-RedhatAI-thinking
- Downloads last month
- 74
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support