-
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 10 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 78 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 183 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88
Kevin-Brian N'Diaye
kevin-nd
AI & ML interests
- Computer Vision
- Vision-Language-Action Models
Organizations
Collections
1
models
None public yet
datasets
None public yet