Gurumurthi V Ramanan's picture

135 435

Gurumurthi V Ramanan

GVR

·

https://surasys.co

AI & ML interests

ML

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a model 16 days ago

deepseek-ai/Janus-Pro-7B

liked a model 16 days ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

GVR's activity

upvoted a paper 27 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 90

upvoted 2 articles 27 days ago

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 68

Article

Mastering Tensor Dimensions in Transformers

By

•

Jan 12

• 44

upvoted a paper 27 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 59

upvoted 2 papers about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

upvoted an article about 1 month ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 33

upvoted 2 papers about 1 month ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 46

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97

upvoted 2 collections about 1 month ago

HuatuoGPT-o1

4 items • Updated Dec 30, 2024 • 15

QVQ-72B-Preview

5 items • Updated Dec 24, 2024 • 7

upvoted a collection about 2 months ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 18 days ago • 26

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 3 collections 2 months ago

DeepSeek-VL2

5 items • Updated 7 days ago • 67

Llama 3.3

5 items • Updated Dec 6, 2024 • 7

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 12 days ago • 36

upvoted an article 2 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 139

upvoted a collection 3 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 4 days ago • 90

upvoted a paper 3 months ago

Multi-Granularity Prediction for Scene Text Recognition

Paper • 2209.03592 • Published Sep 8, 2022 • 2

upvoted a collection 3 months ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 33