Collections
Discover the best community collections!
Collections including paper arxiv:2312.11514
-
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Paper • 2402.14905 • Published • 128 -
Sensor-based Multi-Robot Search and Coverage with Spatial Separation in Unstructured Environments
Paper • 2403.01710 • Published • 2 -
EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models
Paper • 2308.14352 • Published -
Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource Constrained IoT Systems
Paper • 2306.12691 • Published • 2
-
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 557k • • 4.35k -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 19.8k • 348 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 244
-
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 244 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 192 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 180 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation • Updated • 4.23k • 318 -
Isonium/WhiteRabbitNeo-33B-v1-GGUF
Updated • 326 • 8 -
Masterjp123/SnowyRP-FinalV1-L2-13B-GPTQ
Text Generation • Updated • 21 • 3 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 192