Hristo Panev's picture

91 677

Hristo Panev

hppdqdq

·

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

microsoft/Phi-4-multimodal-instruct

liked a model about 21 hours ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

liked a model about 22 hours ago

cyberdelia/CyberIllustrious

View all activity

Organizations

None yet

hppdqdq's activity

upvoted a collection 9 days ago

Deepseek Papers

Deepseek papers collection • 18 items • Updated 9 days ago • 158

upvoted a paper 9 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 11 days ago • 134

upvoted a collection 10 days ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 10 days ago • 28

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

24 days ago

• 1.11k

upvoted an article 25 days ago

Article

Open-R1: Update #1

By

and 7 others •

26 days ago

• 288

upvoted a paper about 1 month ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 51

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 258

upvoted 2 papers 3 months ago

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Paper • 2411.10669 • Published Nov 16, 2024 • 10

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

upvoted a collection 4 months ago

LongVU

7 items • Updated Oct 31, 2024 • 29

upvoted an article 4 months ago

Article

Allegro: Advanced Video Generation Model

By

•

Oct 22, 2024

• 58

upvoted 3 papers 4 months ago

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 14

Retrospective Learning from Interactions

Paper • 2410.13852 • Published Oct 17, 2024 • 9

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15, 2024 • 22

upvoted an article 4 months ago

Article

How to build a custom text classifier without days of human labeling

By

and 4 others •

Oct 17, 2024

• 55

upvoted a collection 4 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153

upvoted a paper 5 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 32

upvoted a collection 5 months ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated Nov 11, 2024 • 17

upvoted an article 5 months ago

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted a paper 5 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145