Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

multimodal-transfer

Activity Feed

AI & ML interests

None defined yet.

Grace Luo's profile picture Amir Bar's profile picture

amirbar1 
authored 8 papers 3 months ago

A Cookbook of Self-Supervised Learning

Paper • 2304.12210 • Published Apr 24, 2023 • 4

Sequential Modeling Enables Scalable Learning for Large Vision Models

Paper • 2312.00785 • Published Dec 1, 2023 • 1

Visual Prompting via Image Inpainting

Paper • 2209.00647 • Published Sep 1, 2022 • 1

EgoPet: Egomotion and Interaction Data from an Animal's Perspective

Paper • 2404.09991 • Published Apr 15, 2024

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11

Navigation World Models

Paper • 2412.03572 • Published Dec 4, 2024 • 1

Forgotten Polygons: Multimodal Large Language Models are Shape-Blind

Paper • 2502.15969 • Published Feb 21 • 2

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published Apr 1 • 31
g-luo 
authored 3 papers 8 months ago

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

Paper • 2305.14334 • Published May 23, 2023 • 1

Readout Guidance: Learning Control from Diffusion Features

Paper • 2312.02150 • Published Dec 4, 2023 • 3

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs