view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • 6 days ago • 24
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 5 days ago • 6
view article Article Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner By abhishek • 3 days ago • 5
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • 5 days ago • 14
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • 14 days ago • 27
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • 13 days ago • 25
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 11 days ago • 13
view article Article Fish Speech V1 - New Multilingual Open Source TTS Model By lengyue233 • 9 days ago • 3
view article Article Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • 12 days ago • 1
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation 14 days ago • 67
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram • 18 days ago • 34
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors By dmsuehir • 18 days ago • 2
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • about 3 hours ago • 41
view article Article Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • 16 days ago • 10
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 16 days ago • 53
view article Article Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • 16 days ago • 4
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 24 days ago • 500
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 21 days ago • 68
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 20 days ago • 228
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais • 24 days ago • 20
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12 • 54
view article Article Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face 27 days ago • 5
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models Paper • 2404.02258 • Published Apr 2 • 99
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Paper • 2404.03648 • Published Apr 4 • 22
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community 28 days ago • 119
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 72
ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning Paper • 2211.05488 • Published Nov 10, 2022 • 1
Multi-Curve Translator for High-Resolution Photorealistic Image Translation Paper • 2203.07756 • Published Mar 15, 2022 • 1
Modular Degradation Simulation and Restoration for Under-Display Camera Paper • 2209.11455 • Published Sep 23, 2022 • 1
StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement Paper • 2107.12898 • Published Jul 27, 2021 • 2
Rethinking Performance Gains in Image Dehazing Networks Paper • 2209.11448 • Published Sep 23, 2022 • 1
Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models Paper • 2310.17086 • Published Oct 26, 2023 • 1
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1 • 1
Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations Paper • 2201.12961 • Published Jan 31, 2022 • 1
Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries Paper • 2210.10750 • Published Oct 19, 2022 • 1
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise Paper • 2208.09392 • Published Aug 19, 2022 • 1
What do Vision Transformers Learn? A Visual Exploration Paper • 2212.06727 • Published Dec 13, 2022 • 1
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback Paper • 2311.17946 • Published Nov 29, 2023 • 1
CodecLM: Aligning Language Models with Tailored Synthetic Data Paper • 2404.05875 • Published Apr 8 • 15
QueryForm: A Simple Zero-shot Form Entity Query Framework Paper • 2211.07730 • Published Nov 14, 2022 • 1
Compositional Semantic Parsing with Large Language Models Paper • 2209.15003 • Published Sep 29, 2022 • 1
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper • 2404.06654 • Published Apr 9 • 31
JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks Paper • 2404.03027 • Published Apr 3 • 2
Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers Paper • 2211.00585 • Published Nov 1, 2022 • 1