Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
williamcstanford 's Collections
video segmentation
diffusion
RL
robotics
LLMs
video gen
Autonomous agents
Transformer improvements
Music gen
video understanding
brain
MUST FOLLOWS
relighting
singing portraits
Depth Estimation
Cellular Automata DL
Code Understanding
datasets

Autonomous agents

updated Jun 20, 2024
Upvote
-

  • WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

    Paper • 2401.13919 • Published Jan 25, 2024 • 32

  • Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

    Paper • 2401.14405 • Published Jan 25, 2024 • 13

  • Design2Code: How Far Are We From Automating Front-End Engineering?

    Paper • 2403.03163 • Published Mar 5, 2024 • 98

  • LLM Agent Operating System

    Paper • 2403.16971 • Published Mar 25, 2024 • 69

  • AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology

    Paper • 2406.11912 • Published Jun 16, 2024 • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs