Stefano Antonioni's picture

183 193

Stefano Antonioni

diddl1970

·

AI & ML interests

NLP, LLM

Recent Activity

upvoted a collection about 20 hours ago

upvoted a collection about 20 hours ago

upvoted a collection about 20 hours ago

View all activity

Organizations

diddl1970's activity

upvoted 12 collections about 20 hours ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 199

Qwen2-Audio

Audio-language model series based on Qwen2 • 4 items • Updated Nov 28, 2024 • 50

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 48

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 355

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated 17 days ago • 66

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 488

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 269

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 56

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 30 days ago • 43

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 5 days ago • 89

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 4 days ago • 290

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 124

upvoted an article 23 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 134

upvoted 3 collections about 1 month ago

Recent highlights

Some recent models worth checking out • 18 items • Updated Nov 1, 2024 • 47

Recommended small models

This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 45

Recommended large models

This collection contains some of the recent models larger than ~25B parameters that should be high quality and reliable • 15 items • Updated Nov 27, 2024 • 15

upvoted 4 collections 3 months ago

OPT

OPT (Open Pretrained Transformer) is a series of open-sourced large causal language models which perform similar in performance to GPT3. • 12 items • Updated Nov 21, 2024 • 4

Sapiens

Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 72 items • Updated Sep 18, 2024 • 53

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9, 2024 • 28

MelodyFlow

MelodyFlow: High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching • 7 items • Updated Oct 23, 2024 • 16