view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 26 days ago β’ 109
DynaSaur: Large Language Agents Beyond Predefined Actions Paper β’ 2411.01747 β’ Published Nov 4, 2024 β’ 28
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 83
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 β’ 793
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 77
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 129
ColPali: Efficient Document Retrieval with Vision Language Models Paper β’ 2407.01449 β’ Published Jun 27, 2024 β’ 44
Transformer Explainer: Interactive Learning of Text-Generative Models Paper β’ 2408.04619 β’ Published Aug 8, 2024 β’ 159
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 β’ 44
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM π€ β’ 9 items β’ Updated Sep 26, 2024 β’ 56
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 650
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context Jul 23, 2024 β’ 228
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper β’ 2407.12327 β’ Published Jul 17, 2024 β’ 78
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize β’ 7 items β’ Updated 19 days ago β’ 75