dex man's picture

11 77

dex man

user2212

·

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

lerobot/visualize_dataset

liked a model 10 days ago

Qwen/Qwen2.5-Omni-7B

liked a model 12 days ago

deepseek-ai/DeepSeek-V3-0324

View all activity

Organizations

None yet

user2212's activity

upvoted a collection 24 days ago

Gemma 3 Release

17 items • Updated 3 days ago • 310

upvoted an article 24 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

25 days ago

• 371

upvoted a collection about 2 months ago

DeepSeek-VL2

5 items • Updated Feb 9 • 72

upvoted a collection 6 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 23 days ago • 300

upvoted 7 collections 7 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 580

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 226

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 40

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 121

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 3 days ago • 85

Extended Mind Transformers

8 items • Updated Jun 5, 2024 • 6

Husky-v1

A unified language agent that addresses numerical, tabular and knowledge-based reasoning tasks. • 6 items • Updated Jun 11, 2024 • 8