Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated 12 days ago • 6
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 6 days ago • 24
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 11 days ago • 26
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper • 2411.10958 • Published 17 days ago • 47
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 12 days ago • 54
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 6 items • Updated 1 day ago • 6
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 11 items • Updated 3 days ago • 32
Drowning in Documents: Consequences of Scaling Reranker Inference Paper • 2411.11767 • Published 15 days ago • 17
Rombos-Coder-V2.5 Collection Collection of coding models made by rombo based on qwen 2.5 • 6 items • Updated 21 days ago • 6
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 29 days ago • 93
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 13 days ago • 19
jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated Sep 19 • 19
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 6 days ago • 240
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 6 days ago • 346