Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning Paper • 2303.15647 • Published Mar 28, 2023 • 5
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • 20 days ago • 27
ablation-models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 7 items • Updated 13 days ago • 20
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 12 days ago • 76
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper • 2309.10400 • Published Sep 19, 2023 • 22
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 5 days ago • 170
SLIM Models Collection Structured Language Instruction Models (SLIMs) • 17 items • Updated Mar 19 • 24
zephyr-7b-sft-full-SPIN Collection Models fine-tuned with SPIN across iterations 0,1,2,3 • 4 items • Updated Feb 7 • 7
datasets-SPIN Collection Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 10
Synatra Family - Korean Model Collection Synatra(Mistral Finetuned model) model collection. • 12 items • Updated 23 days ago • 3