Artur Daveyan's picture

71 587

Artur Daveyan

ArturD

·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

qihoo360/Light-R1-32B

liked a model 25 days ago

Zyphra/Zonos-v0.1-hybrid

liked a model 25 days ago

Zyphra/Zonos-v0.1-transformer

View all activity

Organizations

None yet

ArturD's activity

upvoted a paper about 1 month ago

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published Jan 14 • 20

upvoted 2 collections about 1 month ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 8 items • Updated 18 days ago • 396

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 16 days ago • 107

upvoted a collection about 2 months ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated Jan 17 • 11

upvoted an article 3 months ago

Article

EuroLLM-9B

By

and 5 others •

Dec 2, 2024

• 113

upvoted 2 collections 4 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Feb 10 • 88

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 17 days ago • 96

upvoted a collection 5 months ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 26 items • Updated Feb 1 • 18

upvoted a paper 5 months ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44

upvoted 2 collections 5 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8, 2024 • 22

upvoted an article 5 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 130

upvoted 3 papers 5 months ago

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

Paper • 2410.00255 • Published Sep 30, 2024 • 5

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2, 2024 • 32

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 31

upvoted 2 collections 5 months ago

Yi-Coder

4 items • Updated Sep 4, 2024 • 31

BRAG-v0.1

BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. • 4 items • Updated Aug 4, 2024 • 13

upvoted a collection 6 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 576

upvoted a paper 6 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

upvoted an article 6 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 184