15 342 1164

Reza Sayar PRO

Reza2kn

AI & ML interests

None yet

Recent Activity

liked a Space about 11 hours ago

MCILAB/LLM_Alignment_Evaluation

upvoted a paper about 11 hours ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

upvoted a paper about 11 hours ago

Kuwain 1.5B: An Arabic SLM via Language Injection

View all activity

Organizations

Reza2kn's activity

upvoted 2 papers about 11 hours ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published 2 days ago • 20

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published 3 days ago • 102

upvoted 4 papers 1 day ago

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Paper • 2504.12322 • Published 13 days ago • 27

FocusedAD: Character-centric Movie Audio Description

Paper • 2504.12157 • Published 8 days ago • 9

MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

Paper • 2504.13835 • Published 6 days ago • 35

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Paper • 2504.15280 • Published 3 days ago • 18

upvoted an article 2 days ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

9 days ago

• 19

upvoted a collection 7 days ago

Perception Encoder

Collection

9 items • Updated 7 days ago • 23

upvoted an article 11 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 144

upvoted a collection 13 days ago

InternVL3

Collection

34 items • Updated 4 days ago • 55

upvoted a paper 14 days ago

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published 15 days ago • 20

upvoted 2 papers 15 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 16 days ago • 61

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 16 days ago • 151

upvoted a paper 16 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 17 days ago • 171

upvoted a collection 17 days ago

Black Swan (Abductive and Defeasible Reasoning)

Collection

Data for CVPR 2025 paper, "Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events" • 3 items • Updated Mar 22 • 2

upvoted a paper 17 days ago

MedSAM2: Segment Anything in 3D Medical Images and Videos

Paper • 2504.03600 • Published 20 days ago • 8