DynMoE Family Collection DynMoE model checkpoints and paper on huggingface • 4 items • Updated 3 days ago • 3
Scaling Diffusion Transformers to 16 Billion Parameters Paper • 2407.11633 • Published 6 days ago • 21
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 13
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 29 items • Updated Jun 6 • 260
view article Article Introducing Ghost 8B Beta: A Game-Changing Language Model By lamhieu • 5 days ago • 6
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions Paper • 2407.06723 • Published 13 days ago • 9
view article Article MInference 1.0: 10x Faster Million Context Inference with a Single GPU By liyucheng • 12 days ago • 9
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated 11 days ago • 60
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models Paper • 2407.02687 • Published 20 days ago • 20
Perturbed Attention Guidance pipelines Collection Pipelines for Perturbed Attention Guidance with 🧨 library • 8 items • Updated 26 days ago • 5
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated 25 days ago • 140
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 27 days ago • 75
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper • 2406.18790 • Published 26 days ago • 32
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published 28 days ago • 53
VideoTetris: Towards Compositional Text-to-Video Generation Paper • 2406.04277 • Published Jun 6 • 21
abliterated-v3 Collection Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3 • 74
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 5 days ago • 32
Universal token classification Collection Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated 19 days ago • 12
view article Article Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI. By KingNish • May 21 • 28