InfoSFT: Learn More and Forget Less with Information-Aware Token Weighting
Paper • 2605.14967 • Published
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
InfoSFT: Learn More and Forget Less with Information-Aware Token Weighting:
Trained on Tooluse dataset, split released with Self-Distillation Fine-tuning (SDFT): https://github.com/idanshen/Self-Distillation/tree/main
Model scored 66.27% on the evaluation set