-
HuggingFaceH4/zephyr-7b-alpha
Text Generation • Updated • 71.8k • • 1.08k -
Exponentially Faster Language Modelling
Paper • 2311.10770 • Published • 117 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 69 -
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning
Paper • 2311.11501 • Published • 32
Collections
Discover the best community collections!
Collections including paper arxiv:2311.11501
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 96 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 68 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 39 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 39
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 84 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 63 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 38 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 96
-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper • 2309.07749 • Published • 6 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 23 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 51 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper • 2309.06895 • Published • 27