Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

liked a model about 16 hours ago

trashpanda-org/QwQ-32B-Snowdrop-v0

upvoted a paper 1 day ago

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

upvoted a paper 1 day ago

Automated Movie Generation via Multi-Agent CoT Planning

View all activity

Organizations

None yet

Stars321123's activity

upvoted 2 papers 1 day ago

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published 3 days ago • 64

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 7 days ago • 37

upvoted 2 papers 2 days ago

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Paper • 2503.10618 • Published 3 days ago • 16

Distilling Diversity and Control in Diffusion Models

Paper • 2503.10637 • Published 3 days ago • 12

upvoted a paper 3 days ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 4 days ago • 53

upvoted 3 collections 4 days ago

upvoted 2 papers 5 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 10 days ago • 61

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 7 days ago • 53

upvoted a paper 7 days ago

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 10 days ago • 42

upvoted a paper 12 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 14 days ago • 72

upvoted a paper 19 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 19 days ago • 69

upvoted an article 24 days ago

Article

SigLIP 2: A better multilingual vision language encoder

24 days ago

• 137

upvoted a collection 24 days ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 5 days ago • 145

upvoted an article 25 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

26 days ago

• 65

upvoted 3 papers 25 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 26 days ago • 56

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 25 days ago • 164

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 28 days ago • 52

upvoted a paper 27 days ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Paper • 2502.07780 • Published Feb 11 • 17