Shravankumar Parunandula's picture

6 7 28

Shravankumar Parunandula

shravankumar147

·

https://shravankumar147.substack.com/

AI & ML interests

Computer Vision, NLP, Graph ML

Recent Activity

liked a model 10 days ago

agents-course/notebooks

liked a Space 18 days ago

nanotron/ultrascale-playbook

new activity 18 days ago

acidtib/Travel-Planning-Agent:Inquires About Prompts YAML Design

View all activity

Organizations

shravankumar147's activity

upvoted an article about 1 month ago

Article

We now support VLMs in smolagents!

Jan 24

• 92

upvoted a paper 3 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 61

upvoted a collection 6 months ago

Video

Stability AI's suite of image-to-video models • 5 items • Updated Jan 9 • 78

upvoted 2 papers 6 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

Paper • 2408.16767 • Published Aug 29, 2024 • 31

upvoted 2 articles 9 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 189

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12, 2024

• 94