27 17 79

Susnato Dhar

susnato

susnato

AI & ML interests

CV, NLP

Recent Activity

updated a dataset 8 days ago

susnato/synthetic-data-code-ir

published a dataset 8 days ago

susnato/synthetic-data-code-ir

updated a dataset 8 days ago

susnato/synthetic-data-code-ir-test

View all activity

Organizations

None yet

susnato's activity

updated a dataset 8 days ago

susnato/synthetic-data-code-ir

Viewer • Updated 8 days ago • 6.39k • 77

published a dataset 8 days ago

susnato/synthetic-data-code-ir

Viewer • Updated 8 days ago • 6.39k • 77

updated a dataset 8 days ago

susnato/synthetic-data-code-ir-test

Viewer • Updated 8 days ago • 16 • 74

published a dataset 8 days ago

susnato/synthetic-data-code-ir-test

Viewer • Updated 8 days ago • 16 • 74

updated a dataset 8 days ago

susnato/synthetic-dataset-test-2

Viewer • Updated 8 days ago • 121 • 77

published a dataset 8 days ago

susnato/synthetic-dataset-test-2

Viewer • Updated 8 days ago • 121 • 77

updated a dataset 8 days ago

susnato/synthetic-dataset-test

Viewer • Updated 8 days ago • 239 • 76

published a dataset 8 days ago

susnato/synthetic-dataset-test

Viewer • Updated 8 days ago • 239 • 76

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated Mar 27 • 312k • • 2.79k

upvoted a paper about 1 month ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

liked a dataset about 1 month ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 2.14k • 295

upvoted a paper about 2 months ago

ReAct: Synergizing Reasoning and Acting in Language Models

Paper • 2210.03629 • Published Oct 6, 2022 • 25

liked 2 models about 2 months ago

open-r1/OlympicCoder-32B

Text Generation • Updated Mar 17 • 1.87k • 150

Qwen/QwQ-32B

Text Generation • Updated Mar 11 • 645k • • 2.73k

liked a dataset about 2 months ago

reasoning-course/supervised-finetuning_quiz_student_responses

Viewer • Updated Feb 26 • 10 • 54 • 2

liked a model 2 months ago

Qwen/Qwen2.5-32B-Instruct

Text Generation • Updated Sep 25, 2024 • 421k • • 269

liked 2 Spaces 2 months ago

Predict Memory

🧮

Calculate memory usage from model configurations

2.53k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters