FINER-SQL-4B-BIRD / README.md

Update README.md

fc381fa verified 2 months ago

626 Bytes

metadata

base_model:
  - griffith-bigdata/Qwen3-4B-SQL-Writer

FINER-SQL-4B-BIRD

Trained from griffith-bigdata/Qwen3-4B-SQL-Writer using GRPO with two additional dense rewards from the FINER-SQL paper:

🧠 Memory Reward — aligns reasoning with verified traces
⚙️ Atomic Reward — measures operation-level SQL overlap

✅ 68.4% EX on BIRD when training only on BIRD train; infer on a 24 GB GPU