-
microsoft/Phi-3-mini-4k-instruct
Text Generation • Updated • 1.14M • 776 -
microsoft/Phi-3-mini-128k-instruct
Text Generation • Updated • 2.05M • 1.41k -
microsoft/Phi-3-small-8k-instruct
Text Generation • Updated • 142k • 115 -
microsoft/Phi-3-small-128k-instruct
Text Generation • Updated • 20.8k • 135
Collections
Discover the best community collections!
Collections including paper arxiv:2404.14219
-
stabilityai/stable-diffusion-3-medium
Text-to-Image • Updated • 2.9M • 3.11k -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 237 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 240 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 255
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 20 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 75 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 135 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 23
-
Attention Is All You Need
Paper • 1706.03762 • Published • 39 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 10 -
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
Paper • 2305.13245 • Published • 5 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 237