c

ab77c

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

Comfy-Org/Krea-2

liked a model 8 days ago

RecursiveMAS/Deliberation-Reflector-Qwen3.5-4B

liked a model 10 days ago

RecursiveMAS/Distillation-Expert-Qwen3.5-9B

View all activity

Organizations

upvoted an article about 2 months ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

Apr 28

• 62

upvoted 2 collections 2 months ago

Ternary Bonsai

1.58-bit Bonsai models • 9 items • Updated 25 days ago • 96

Gemma 4

15 items • Updated 19 days ago • 1k

upvoted 4 collections 3 months ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 18 days ago • 168

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 32 items • Updated 18 days ago • 62

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 18 days ago • 173

Bonsai

1-bit Bonsai models • 7 items • Updated 25 days ago • 208

upvoted a collection 4 months ago

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated Feb 17 • 76

upvoted 2 collections 5 months ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated Dec 16, 2025 • 24

FLUX.2

Our second generation of FLUX • 21 items • Updated Apr 6 • 248

upvoted a collection 9 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 204

upvoted a collection about 1 year ago

MiMo-VL

6 items • Updated Dec 17, 2025 • 45

upvoted a paper about 1 year ago

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 74

upvoted a collection about 1 year ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

upvoted an article over 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted a paper over 1 year ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 131

upvoted a collection over 1 year ago

Cosmos-Preidct1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 14 items • Updated 18 days ago • 304

upvoted a paper over 1 year ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115

upvoted 2 collections over 1 year ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 675

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 18 days ago • 54