Turbo Pascal's picture

Turbo Pascal

TurboPascal

·

AI & ML interests

None yet

Recent Activity

updated a collection 23 days ago

upvoted a paper 29 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

updated a collection about 1 month ago

View all activity

Organizations

updated a collection 23 days ago

LLM

3 items • Updated 23 days ago

upvoted a paper 29 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published Mar 26 • 19

updated a collection about 1 month ago

LLM

3 items • Updated 23 days ago

upvoted a collection about 2 months ago

Marco-MoE

A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated Apr 8 • 17

liked a model 2 months ago

AIDC-AI/Marco-Mini-Global-Base

Text Generation • 17B • Updated Apr 3 • 20 • 7

liked a model 3 months ago

AIDC-AI/Marco-Nano-Base

Text Generation • 8B • Updated Apr 3 • 20 • 15

upvoted 2 papers 3 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 113

liked 4 datasets 3 months ago

MaziyarPanahi/Nemotron-Cascade-2-SFT-Data-Small

Viewer • Updated Mar 22 • 4.9M • 607 • 4

nvidia/Nemotron-Cascade-2-SFT-Data

Viewer • Updated Mar 19 • 15.9M • 9.99k • 68

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated Mar 14 • 1.62M • 6.68k • 338

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated Mar 31 • 2.33k • 2.05k • 622

upvoted 2 papers 3 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 161

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

liked a model 3 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 1.01M • • 1.98k

New activity in Alibaba-NLP/new-impl 4 months ago

torch.AcceleratorError: CUDA error: device-side assert triggered

#14 opened 4 months ago by

liked a model 8 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 744k • 366

upvoted an article 9 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 195

liked a model 10 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated Aug 26, 2025 • 30.4k • 502

upvoted a collection 10 months ago

BGE

31 items • Updated Feb 4 • 163