SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages Paper • 2407.19672 • Published Jul 29 • 54
view article Article Fine-Tuning LLMs: Supervised Fine-Tuning and Reward Modelling By rishiraj • Dec 4, 2023 • 2
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning By Andyrasika • Jan 19 • 8
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 8 days ago • 585
Llama 3.1 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.1 models, including the configurations, • 6 items • Updated 8 days ago • 15
view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning By damjan-k • Feb 20 • 11
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 174
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity Paper • 2401.17072 • Published Jan 30 • 25
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 28
InternVL 1.5 Collection A Pioneering Open-Source Alternative to GPT-4V • 8 items • Updated Aug 23 • 8
Lumos : Empowering Multimodal LLMs with Scene Text Recognition Paper • 2402.08017 • Published Feb 12 • 24
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20 • 45
view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 • 35
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 57
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 26
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 82
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 17
view article Article Generating Human-level Text with Contrastive Search in Transformers 🤗 Nov 8, 2022 • 6
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 95