Gyanateet Dutta's picture

Gyanateet Dutta

Ryukijano

·

https://ryukijano.github.io

AI & ML interests

Computer Graphics, General Artificial Intelligence,model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.

Recent Activity

upvoted a collection 2 days ago

TxGemma Release

liked a model 3 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a Space 4 days ago

openai/whisper

View all activity

Organizations

Ryukijano's activity

upvoted a collection 2 days ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 5 days ago • 44

upvoted a paper 4 days ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 177

upvoted a paper 6 days ago

Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data

Paper • 2503.21694 • Published 12 days ago • 15

upvoted a collection 19 days ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 19 days ago • 90

upvoted an article about 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.21k

upvoted a collection 2 months ago

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 5 days ago • 31

upvoted a collection 4 months ago

VILA: On Pre-training for Visual Language Models

10 items • Updated Oct 31, 2024 • 53

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 124

upvoted a paper 5 months ago

Grounding Image Matching in 3D with MASt3R

Paper • 2406.09756 • Published Jun 14, 2024 • 1

upvoted 2 collections 5 months ago

Sparsh

Models and datasets for Sparsh: Self-supervised touch representations for vision-based tactile sensing • 15 items • Updated Oct 24, 2024 • 12

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111

upvoted a collection 6 months ago

Stable Diffusion 3.5

6 items • Updated Jan 9 • 152

upvoted 2 papers 6 months ago

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Paper • 2410.10774 • Published Oct 14, 2024 • 26

MonoFormer: One Transformer for Both Diffusion and Autoregression

Paper • 2409.16280 • Published Sep 24, 2024 • 18

upvoted a collection 6 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 586

upvoted 2 collections 7 months ago

3D

Stability AI's suite of models for 3D generation • 6 items • Updated Jan 9 • 38

SAM2

All the models and demos for SAM2 • 8 items • Updated Aug 2, 2024 • 13