1 31 3

Jonathan LYS

jonathan-lys

jonathanlys01

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

upvoted a paper 29 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

upvoted a collection 29 days ago

Generalized Interpolating Discrete Diffusion

View all activity

Organizations

jonathan-lys's activity

upvoted a paper 2 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 6 days ago • 20

upvoted a paper 29 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 94

upvoted a collection 29 days ago

Generalized Interpolating Discrete Diffusion

Collection

6 items • Updated Mar 4 • 4

upvoted 2 papers 2 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 47

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 137

upvoted a paper 4 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 54

liked a Space 4 months ago

547

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted 2 papers 4 months ago

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 114

TinyFusion: Diffusion Transformers Learned Shallow

Paper • 2412.01199 • Published Dec 2, 2024 • 14

upvoted 2 papers 5 months ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 82

upvoted 3 papers 6 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 94

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 66

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 150

liked a Space 8 months ago

ASR Comparaison

🦀

upvoted 2 papers 9 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 115

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 115

upvoted a paper 11 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 68