Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2310.16764
common
Collection by Nov 14, 2023
-
  • ConvNets Match Vision Transformers at Scale

    Paper • 2310.16764 • Published Oct 25, 2023 • 21
Transformers
Collection by Oct 27, 2023
-
  • ConvNets Match Vision Transformers at Scale

    Paper • 2310.16764 • Published Oct 25, 2023 • 21
CNN
Collection by Oct 26, 2023
-
  • ConvNets Match Vision Transformers at Scale

    Paper • 2310.16764 • Published Oct 25, 2023 • 21
Generic
Collection by Oct 26, 2023
-
  • ConvNets Match Vision Transformers at Scale

    Paper • 2310.16764 • Published Oct 25, 2023 • 21
Multimodal
Collection by 23 days ago
19
  • Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Paper • 2310.16045 • Published Oct 24, 2023 • 17
  • HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Paper • 2310.14566 • Published Oct 23, 2023 • 27
  • SILC: Improving Vision Language Pretraining with Self-Distillation

    Paper • 2310.13355 • Published Oct 20, 2023 • 9
  • Conditional Diffusion Distillation

    Paper • 2310.01407 • Published Oct 2, 2023 • 20
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs