Adithya S K's picture

Adithya S K PRO

AdithyaSK

·

https://adithyask.com/

AI & ML interests

finetuning models to perform specific task and deploying them into production

Recent Activity

updated a collection 2 days ago

updated a model 3 days ago

AdithyaSK/ViViDLayout3b_v1

published a model 3 days ago

AdithyaSK/ViViDLayout3b_v1

View all activity

Organizations

AdithyaSK's activity

upvoted a paper 2 months ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published Dec 26, 2024 • 18

upvoted a collection 3 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 141

upvoted a paper 3 months ago

ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Paper • 2411.18363 • Published Nov 27, 2024 • 10

upvoted a paper 4 months ago

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 26

upvoted a paper 5 months ago

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Paper • 2109.10282 • Published Sep 21, 2021 • 6

upvoted a collection 5 months ago

Medical Multimodal Datasets

Datasets that can be used to train and/or evaluate medical multimodal models. • 3 items • Updated Dec 9, 2023 • 2

upvoted a paper 10 months ago

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 55

upvoted a paper about 1 year ago

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12, 2024 • 47

upvoted 2 papers over 1 year ago

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 33

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50