Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 3 days ago • 47
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 6 days ago • 45
Configurable Safety Tuning ⚙️ Collection CST allows for configurable inference-time control of LLM safety levels, so users can dictate model behavior based on the system prompt • 7 items • Updated about 6 hours ago • 1
Configurable Safety Tuning of Language Models with Synthetic Preference Data Paper • 2404.00495 • Published Mar 30 • 1
Quantized Models (GGUF, IQ, Imatrix) Collection Various quantizations of models in the GGUF format. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 72 items • Updated about 1 hour ago • 29
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs Paper • 2402.08005 • Published Feb 12 • 1
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models Paper • 2402.03749 • Published Feb 6 • 9
🛰️🌍 Geospatial Datasets Collection A curated collections of diverse geospatial and satellite imagery datasets. • 54 items • Updated Mar 6 • 10
Exotic Frankenmerges 🥨 Collection Merges of models of different architectures and sizes that end up working surprisingly well • 1 item • Updated Jan 21 • 1
Upscaled Models ⏫ Collection A collection of my frankenmerges, upscaling several models. All of them have the corresponding GGUF variants. • 4 items • Updated Jan 20 • 2
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 28 items • Updated Mar 23 • 172
Distilled Self-Critique of LLMs with Synthetic Data: a Bayesian Perspective Paper • 2312.01957 • Published Dec 4, 2023 • 1
Optimised Translation Models 🌍 Collection A collection of optimised and quantised multilingual translation models • 6 items • Updated Nov 7, 2023 • 3
Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation Paper • 2308.07929 • Published Jul 15, 2023 • 1
Personalizing Text-to-Image Generation via Aesthetic Gradients Paper • 2209.12330 • Published Sep 25, 2022 • 1
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition Paper • 2307.13269 • Published Jul 25, 2023 • 29