gemma-4-31b-it-heretic-ara-eagle3-ko
This is Eagle-3 draft model for Korean conversation only. For other languages, please use other model.
You may expect 1.5x speed boost at maximum on Korean workload.
νκ΅μ΄ λνμ μ΅μ ν λ Eagle-3 Draft λͺ¨λΈμ λλ€. ν μΈμ΄μ κ²½μ° λ€λ₯Έ λͺ¨λΈμ μ¬μ©νμΈμ.
νκ΅μ΄ μμ μμ μ΅λ 1.5x μ λ μλ κ°μ μ΄ μμ΅λλ€.
Model Overview
- Verifier: hell0ks/gemma-4-31b-it-heretic-ara-FP8
- Speculative Decoding Algorithm: EAGLE-3
- Model Architecture: Eagle3Speculator
- Release Date: 2026/05/05
How it was made
- Training framework: Speculators
- Datasets: Private (Korean 8: English 2, 60k, No Reasoning)
- Training hardware: 1 DGX Spark
| Name | Value |
|---|---|
| Learning Rate | 1e-4 |
| Scheduler Type | Cosine |
| Warmup steps | 50 |
| Sequence length | 4096 |
| Epochs | 4 |
| Vocab size | 32000 |
Usage
Tested with vLLM on DGX Spark (sm121)
vllm serve hell0ks/gemma-4-31b-it-heretic-ara-FP8 --port 8000 --reasoning-parser gemma4 --enable-auto-tool-choice --tool-call-parser gemma4 --speculative-config '{"model": "hell0ks/gemma-4-31b-it-heretic-ara-eagle3-ko", "num_speculative_tokens": 3, "method": "eagle3"}'
- Downloads last month
- 61
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support