yamayou
's Collections
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
•
2402.14083
•
Published
•
47
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
605
Genie: Generative Interactive Environments
Paper
•
2402.15391
•
Published
•
70
Humanoid Locomotion as Next Token Prediction
Paper
•
2402.19469
•
Published
•
26
ViTAR: Vision Transformer with Any Resolution
Paper
•
2403.18361
•
Published
•
52
Simulating Classroom Education with LLM-Empowered Agents
Paper
•
2406.19226
•
Published
•
30
MIRAI: Evaluating LLM Agents for Event Forecasting
Paper
•
2407.01231
•
Published
•
16
Prithvi WxC: Foundation Model for Weather and Climate
Paper
•
2409.13598
•
Published
•
40
Selective Attention Improves Transformer
Paper
•
2410.02703
•
Published
•
23
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Paper
•
2411.17465
•
Published
•
77
Chimera: Improving Generalist Model with Domain-Specific Experts
Paper
•
2412.05983
•
Published
•
9
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper
•
2412.08635
•
Published
•
41
Large Action Models: From Inception to Implementation
Paper
•
2412.10047
•
Published
•
31
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
•
2412.09871
•
Published
•
81
AnySat: An Earth Observation Model for Any Resolutions, Scales, and
Modalities
Paper
•
2412.14123
•
Published
•
11