view article Article Adapt custom AI models to the trainer API and to 🤗 By not-lain • 8 days ago • 15
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • 16 days ago • 24
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published 30 days ago • 37
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • 10 days ago • 41
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais • Apr 18 • 20
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 129
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive By bpan • Apr 9 • 26
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30 • 39
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated about 1 month ago • 23