x

antmanler

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a Space about 2 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

antmanler's activity

liked a model 21 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 6 days ago • 119k • 1.07k

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.68M • • 1.12k

liked a Space about 2 months ago

2.42k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 4 models about 2 months ago

liked 3 models 3 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 11 days ago • 6.98k • 889

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 39.5k • 542

microsoft/phi-4

Text Generation • Updated Feb 24 • 558k • • 1.97k

liked a dataset 3 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 186k • 2.08k

liked a model 4 months ago

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • Updated Nov 9, 2024 • 22.4k • 33

liked a Space 4 months ago

544

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 2 models 4 months ago

recursal/QRWKV6-32B-Instruct-Preview-v0.1

Text Generation • Updated 25 days ago • 250 • 77

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 15

liked a dataset 4 months ago

HuggingFaceTB/smoltalk

Viewer • Updated Feb 10 • 2.2M • 6.13k • 318

liked a Space 5 months ago

Omni Mini

🌖

liked 3 models 5 months ago

kinetical/llama3.2-3b-simulMT-et-en

Updated Nov 9, 2024 • 7 • 3

kyutai/mimi

Feature Extraction • Updated Sep 18, 2024 • 302k • 182

fishaudio/fish-agent-v0.1-3b

Audio-to-Audio • Updated Nov 1, 2024 • 602 • 256