DFKI-SLT/few-nerd
Viewer • Updated • 514k • 2.66k • 21
How to use AurelPx/lfm25-1.2b-grpo-fewnerd-intra with Unsloth Studio:
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for AurelPx/lfm25-1.2b-grpo-fewnerd-intra to start chatting
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for AurelPx/lfm25-1.2b-grpo-fewnerd-intra to start chatting
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for AurelPx/lfm25-1.2b-grpo-fewnerd-intra to start chatting
pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
model_name="AurelPx/lfm25-1.2b-grpo-fewnerd-intra",
max_seq_length=2048,
)LoRA adapter fine-tuned with Group Relative Policy Optimization (GRPO) on the FewNERD INTRA benchmark.
Evaluation on the FewNERD INTRA test split (n=200).
| Metric | Base model | Fine-tuned |
|---|---|---|
| Span F1 | 0.4288 | 0.4655 |
| JSON validity rate | 1.0 | 1.0 |
| Schema validity rate | 0.92 | 1.0 |
| Function | Weight | Signal |
|---|---|---|
reward_valid_json |
0.5 | Valid JSON schema |
reward_valid_schema |
0.5 | Valid entity types |
reward_span_f1 |
2.0 | Exact text + type match |
reward_recall_bonus |
0.3 | Encourages extracting more entities |
reward_partial_text |
0.5 | Credit for partial span overlaps |
@inproceedings{ding2021fewnerd,
title = {Few-NERD: A Few-Shot Named Entity Recognition Dataset},
author = {Ding, Ning and Xu, Guangwei and Chen, Yulin and others},
booktitle = {Proceedings of ACL 2021},
year = {2021}
}
Base model
LiquidAI/LFM2.5-1.2B-Base