-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 105 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 43 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 44
Jacopo Parvizi
neuraloverflow
·
AI & ML interests
Deep Learning, Time series forecasting, Recommender systems
Recent Activity
upvoted
a
paper
11 days ago
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
upvoted
a
paper
11 days ago
V-Thinker: Interactive Thinking with Images
upvoted
a
paper
11 days ago
Thinking with Video: Video Generation as a Promising Multimodal
Reasoning Paradigm
Organizations
None yet