asturian-nllb-lora-chrfpp

LoRA adapter for Spanish→Asturian (spa_Latnast_Latn) machine translation, fine-tuned on top of NLLB-200-distilled-600M. Model checkpoint selected by highest chrF++ score on the FLORES+ dev set.

Training Data

Training Details

  • Base model: facebook/nllb-200-distilled-600M
  • Method: LoRA (r=16, alpha=32, target modules: q_proj, k_proj, v_proj, out_proj)
  • Batch size: 32 (16 per device × 2 gradient accumulation steps)
  • Learning rate: 2e-4 (cosine schedule, 5% warmup)
  • Epochs: 1.5
  • Validation: chrF++ on FLORES+ dev set
  • Precision: bfloat16

Results on FLORES+ devtest

BLEU chrF++ COMET BLEURT
16.26 45.48 66.95 42.70

Usage

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
from peft import PeftModel

model_name = "facebook/nllb-200-distilled-600M"
adapter_name = "ikergf/asturian-nllb-lora-chrfpp"

tokenizer = AutoTokenizer.from_pretrained(adapter_name)
tokenizer.src_lang = "spa_Latn"

base_model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
model = PeftModel.from_pretrained(base_model, adapter_name)
model.eval()

forced_bos_token_id = tokenizer.convert_tokens_to_ids("ast_Latn")

inputs = tokenizer("Hola, ¿cómo estás?", return_tensors="pt")
outputs = model.generate(
    **inputs,
    forced_bos_token_id=forced_bos_token_id,
    max_new_tokens=128,
    num_beams=5
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ikergf/asturian-nllb-lora-chrfpp

Adapter
(103)
this model