-
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Paper • 2402.10176 • Published • 33 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 50 -
Beyond Language Models: Byte Models are Digital World Simulators
Paper • 2402.19155 • Published • 46 -
Matryoshka Representation Learning
Paper • 2205.13147 • Published • 7
Vinit
vinit97
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet