-
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 787k • • 3.99k -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 7.16k • 303 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 255 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 238
Collections
Discover the best community collections!
Collections including paper arxiv:2312.11514
-
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 238 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 176 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 178 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 255
-
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation • Updated • 3.8k • 310 -
Isonium/WhiteRabbitNeo-33B-v1-GGUF
Updated • 116 • 7 -
Masterjp123/SnowyRP-FinalV1-L2-13B-GPTQ
Text Generation • Updated • 1 • 3 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 176
-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 255 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 78 -
Mixtral of Experts
Paper • 2401.04088 • Published • 156 -
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 92