-
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Paper โข 2309.10150 โข Published โข 25 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper โข 2310.10638 โข Published โข 30 -
Farzi Data: Autoregressive Data Distillation
Paper โข 2310.09983 โข Published โข 10 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper โข 2311.05437 โข Published โข 50
Mat Miller
matdmiller
AI & ML interests
None yet
Recent Activity
published
a Space
9 days ago
cluster-of-stars/TinyStoriesHackathonLeaderboard
updated
a Space
9 days ago
matdmiller/ClusterOfStarsTinyHackathon
published
a Space
9 days ago
matdmiller/ClusterOfStarsTinyHackathon