Running 196 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 196 Building and scaling RL environments for LLM training
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 94
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. • 168 items • Updated 26 days ago • 6
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated Apr 6 • 47.2k • 610