Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 10 items β’ Updated 15 days ago β’ 17
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. β’ 7 items β’ Updated 1 day ago β’ 35
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 8 items β’ Updated 17 days ago β’ 171
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 67 items β’ Updated Jul 3 β’ 76
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Paper β’ 2402.09052 β’ Published Feb 14 β’ 17
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation Paper β’ 2402.08682 β’ Published Feb 13 β’ 12
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis Paper β’ 2401.17093 β’ Published Jan 30 β’ 19
Kosmos-G: Generating Images in Context with Multimodal Large Language Models Paper β’ 2310.02992 β’ Published Oct 4, 2023 β’ 4
RealFill: Reference-Driven Generation for Authentic Image Completion Paper β’ 2309.16668 β’ Published Sep 28, 2023 β’ 14