Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2403.16971

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62
AutoDev: Automated AI-Driven Development

Paper • 2403.08299 • Published Mar 13 • 1

Papers - Agent - Operating Systems

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 40

Papers - Agent - Memory

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 2
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62

Papers - Agent - Architecture

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 2
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62
Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13 • 23

To read... eventually

about 5 hours ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 119
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 44
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6 • 9
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12 • 37
microsoft/phi-1_5

Text Generation • Updated 21 days ago • 123k • 1.28k
Language models scale reliably with over-training and on downstream tasks

Paper • 2403.08540 • Published Mar 13 • 13
Akashpb13/Swahili_xlsr

Automatic Speech Recognition • Updated Aug 27, 2023 • 503 • 7

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 66
Character-LLM: A Trainable Agent for Role-Playing

Paper • 2310.10158 • Published Oct 16, 2023 • 1
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 62
RakutenAI-7B: Extending Large Language Models for Japanese

Paper • 2403.15484 • Published Mar 21 • 12

ibm/AttaQ

Viewer • Updated Jan 26 • 1.2k • 4
ibm/merlinite-7b

Text Generation • Updated Mar 5 • 14.8k • 99
microsoft/Orca-2-13b

Text Generation • Updated Nov 22, 2023 • 21.2k • 649
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11 • 2 • 9

Papers - Training Research

Measuring the Effects of Data Parallelism on Neural Network Training

Paper • 1811.03600 • Published Nov 8, 2018 • 2
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

Paper • 1804.04235 • Published Apr 11, 2018 • 2
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Paper • 1905.11946 • Published May 28, 2019 • 2
Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 59

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27 • 17
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26
Do Large Language Models Latently Perform Multi-Hop Reasoning?

Paper • 2402.16837 • Published Feb 26 • 24
Divide-or-Conquer? Which Part Should You Distill Your LLM?

Paper • 2402.15000 • Published Feb 22 • 22

Previous
1
2
3
4
5
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs