Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

upvoted a paper about 23 hours ago

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

upvoted a paper about 23 hours ago

Automated Movie Generation via Multi-Agent CoT Planning

liked a dataset 1 day ago

huggingface/documentation-images

View all activity

Organizations

None yet

Stars321123's activity

upvoted 2 papers about 23 hours ago

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published 3 days ago • 60

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 6 days ago • 37

liked a dataset 1 day ago

huggingface/documentation-images

Viewer • Updated 1 day ago • 50 • 4.84M • 54

upvoted 2 papers 1 day ago

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Paper • 2503.10618 • Published 3 days ago • 16

Distilling Diversity and Control in Diffusion Models

Paper • 2503.10637 • Published 3 days ago • 12

liked a model 2 days ago

NousResearch/DeepHermes-3-Llama-3-3B-Preview-GGUF

Updated 3 days ago • 387 • 19

upvoted a paper 2 days ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 4 days ago • 49

liked a Space 3 days ago

1.21k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

upvoted a collection 3 days ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated 4 days ago • 79

liked 2 models 3 days ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 4 days ago • 190k • 651

google/gemma-3-4b-it

Image-Text-to-Text • Updated 4 days ago • 79.8k • 204

upvoted a collection 3 days ago

Gemma 3 Release

Collection

9 items • Updated 3 days ago • 249

liked a model 3 days ago

google/shieldgemma-2b

Text Generation • Updated Aug 28, 2024 • 1.13k • 62

upvoted a collection 3 days ago

ShieldGemma Release

Collection

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated 4 days ago • 12

upvoted a paper 4 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 10 days ago • 61

liked a dataset 5 days ago

agents-course/certificates

Viewer • Updated 2 minutes ago • 13 • 57.1k • 37

upvoted a paper 5 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 6 days ago • 53

upvoted a paper 6 days ago

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 9 days ago • 42

liked a Space 6 days ago

2.26k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 11 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 13 days ago • 72