Model Card for Model ID

This is a saved checkpoint from fine-tuning a Qwen/Qwen2.5-0.5B model via LoRA for the digit-sequence transformation task the paper "Skill Neologisms: Towards Skill-based Continual Learning" (ICML 2026).

Model Details

Model Description

This checkpoint was fine-tuned on a synthetic digit-sequence transformation task (see paper for full details).

Model Sources

Uses

Please refer to the official repo for example use.

Training Details

The LoRA adapter is trained over two phases with the configuration below (please refer to the paper for more details).

Training configuration.
Parameter Phase 1 Phase 2
Base Model Qwen/Qwen2.5-0.5B
PEFT Method LoRA (r=32, α=32)
Target Modules q, k, v, o, gate, up, down
Training Samples 100,000 500,000
Test Samples 500 500
Operations per Sample 1 1–3
Epochs 3 3
Batch Size 64 64
Learning Rate 2e-4 2e-4
Warmup Steps 500 500
Operations [ASC], [DESC], [ADD], [SUB], [POL], [REV], [ID]
Sequence Lengths 2, 3, 4, 6, 8 (held-out: 5, 7, 9)
Held-out 3-op combinations -- 25%

Technical Specifications

Hardware

This model was trained on a NVIDIA RTX 6000 GPU (48GB VRAM).

Citation

BibTeX:

@article{berthon2026skill,
  title={Skill Neologisms: Towards Skill-based Continual Learning},
  author={Berthon, Antonin and Astorga, Nicolas and van der Schaar, Mihaela},
  journal={arXiv preprint arXiv:2605.04970},
  year={2026}
}

Model Card Contact

Antonin Berthon (berthon/dot\antonin/at\gmail/dot\com)

Framework versions

  • transformers 5.0.0rc1
  • peft 0.17.1
  • trl 0.26.1
Downloads last month
212
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for antoninbrthn/skill-neologisms-qwen2.5-digitseq-base

Adapter
(409)
this model

Paper for antoninbrthn/skill-neologisms-qwen2.5-digitseq-base