x

antmanler

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a Space about 1 month ago

nanotron/ultrascale-playbook

View all activity

Organizations

antmanler's activity

liked a model 18 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 3 days ago • 115k • 1.05k

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.77M • • 1.12k

liked a Space about 1 month ago

2.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 4 models about 2 months ago

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 9 days ago • 7.52k • 886

liked 2 models 3 months ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 39.6k • 541

microsoft/phi-4

Text Generation • Updated Feb 24 • 596k • • 1.96k

liked a dataset 3 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 193k • 2.08k

liked a model 4 months ago

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • Updated Nov 9, 2024 • 23.9k • 33

liked a Space 4 months ago

544

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 2 models 4 months ago

recursal/QRWKV6-32B-Instruct-Preview-v0.1

Text Generation • Updated 22 days ago • 267 • 77

VITA-MLLM/Freeze-Omni

Updated Nov 26, 2024 • 15

liked a dataset 4 months ago

HuggingFaceTB/smoltalk

Viewer • Updated Feb 10 • 2.2M • 6.45k • 318

liked a Space 5 months ago

Omni Mini

🌖

liked 3 models 5 months ago

kinetical/llama3.2-3b-simulMT-et-en

Updated Nov 9, 2024 • 17 • 3

kyutai/mimi

Feature Extraction • Updated Sep 18, 2024 • 295k • 179

fishaudio/fish-agent-v0.1-3b

Audio-to-Audio • Updated Nov 1, 2024 • 618 • 256