Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated about 21 hours ago • 70
🏟️ Long Code Arena Collection All the resources for our Long Code Arena benchmark! • 13 items • Updated Jun 19, 2024 • 5
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 21 days ago • 29
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 127
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 10