9 100 20

Dhruv Diddi

ddiddi

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

GetSoloTech/gemma-3-1b-it-GGUF

liked a model 7 days ago

deepseek-ai/DeepSeek-V3-0324

upvoted an article 12 days ago

Transformers.js v3: WebGPU support, new models & tasks, and more…

View all activity

Organizations

ddiddi's activity

liked a model 5 days ago

GetSoloTech/gemma-3-1b-it-GGUF

Updated 6 days ago • 39 • 2

liked a model 7 days ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated 6 days ago • 86.6k • • 2.18k

upvoted an article 12 days ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 72

upvoted a collection 17 days ago

Gemma 3 Release

Collection

17 items • Updated 5 days ago • 302

upvoted 7 papers 20 days ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published 21 days ago • 8

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Paper • 2503.05860 • Published 25 days ago • 9

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

Paper • 2503.08417 • Published 21 days ago • 8

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published 21 days ago • 11

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 22 days ago • 95

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 22 days ago • 40

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 22 days ago • 83

liked 2 models 27 days ago

bartowski/Qwen_QwQ-32B-GGUF

Text Generation • Updated 27 days ago • 214k • 150

Qwen/QwQ-32B

Text Generation • Updated 21 days ago • 766k • • 2.6k

published a Space 27 days ago

Solo Qwen QwQ 32B

💬

liked a Space about 1 month ago

Llasa 1b Multilingual TTS

🌍

Generate speech from text with or without cloning a voice

reacted to JingzeShi's post with 🚀 about 1 month ago

Post

2972

🤗Welcome to the Doge Edge Device Small language Model.

SmallDoge/Doge-160M-Instruct

upvoted 2 papers about 2 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 21

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 114

liked a model 2 months ago

bartowski/Mistral-Small-24B-Instruct-2501-GGUF

Text Generation • Updated Jan 30 • 34k • 111

upvoted a paper 2 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 90