Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models • 25 items • Updated 3 days ago • 49
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7 • 39
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 96
Foundation Models for Vision 🧩 Collection Foundation models for computer vision. • 24 items • Updated Mar 11 • 17
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 27