🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 6 days ago • 98
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity Paper • 2401.17072 • Published Jan 30 • 24
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 27
InternVL 1.5 Collection A Pioneering Open-Source Alternative to GPT-4V • 8 items • Updated 15 days ago • 5
Lumos : Empowering Multimodal LLMs with Scene Text Recognition Paper • 2402.08017 • Published Feb 12 • 24
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20 • 44
view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 • 25
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 34
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 14
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 54
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 12
view article Article Generating Human-level Text with Contrastive Search in Transformers 🤗 Nov 8, 2022 • 4
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 63