3 20 21

Metal Whale

metalwhale

https://blog.metalwhale.dev/

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

microsoft/Phi-4-multimodal-instruct

liked a model 8 days ago

Wan-AI/Wan2.1-T2V-14B

upvoted an article 30 days ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

None yet

metalwhale's activity

liked 2 models 8 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 3 days ago • 113k • 976

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 9 days ago • 179k • • 906

upvoted an article 30 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 792

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 12 days ago • 4.16M • • 11k

liked a model about 2 months ago

vikhyatk/moondream2

Image-Text-to-Text • Updated Jan 9 • 131k • 1.06k

upvoted a paper 3 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93

liked a model 3 months ago

tencent/HunyuanVideo

Text-to-Video • Updated 1 day ago • 6.45k • • 1.73k

upvoted a collection 3 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 25 days ago • 298

upvoted an article 4 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 100

upvoted a paper 5 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

liked 5 models 5 months ago

liked a model 6 months ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Feb 4 • 78.6k • 1.41k

upvoted a collection 6 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 10 days ago • 549

liked 2 models 6 months ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • Updated Jan 12 • 1.54M • • 540

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Dec 11, 2024 • 3.98k • 700