MUSCLE: A Model Update Strategy for Compatible LLM Evolution Paper • 2407.09435 • Published 10 days ago • 19
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • 14 days ago • 29
AgentInstruct: Toward Generative Teaching with Agentic Flows Paper • 2407.03502 • Published 19 days ago • 35
Step-DPO Collection Resources for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs" • 11 items • Updated 21 days ago • 3
SpeechVerse: A Large-scale Generalizable Audio Language Model Paper • 2405.08295 • Published May 14 • 12
TaskMeAnything Collection A collection of TaskMeAnything resources [https://github.com/JieyuZ2/TaskMeAnything] • 10 items • Updated Jun 14 • 2
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 29 days ago • 142
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering Paper • 2406.10208 • Published Jun 14 • 21
Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published Jun 4 • 36
view article Article Fish Speech V1 - New Multilingual Open Source TTS Model By lengyue233 • May 3 • 10
PM-pair Collection This is a collection of materials for training pairwise preference model. • 3 items • Updated May 10 • 1
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Apr 15 • 24
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive By bpan • Apr 9 • 28
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 149
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models Paper • 2309.09958 • Published Sep 18, 2023 • 18
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale Paper • 2309.06497 • Published Sep 12, 2023 • 4