Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 36 items • Updated 1 day ago • 9
Scaling LLM Pre-training with Vocabulary Curriculum Paper • 2502.17910 • Published 5 days ago • 1 • 2
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 5 days ago • 54
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 11 days ago • 58
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published 17 days ago • 33
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 12 days ago • 63
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation Paper • 2502.13581 • Published 11 days ago • 5
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
Autellix: An Efficient Serving Engine for LLM Agents as General Programs Paper • 2502.13965 • Published 11 days ago • 18
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 185 items • Updated 5 days ago • 4
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published Jan 28 • 26