Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2404.08639

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Audio Reading - 2404.08639 - COCONut

Read by Bark: https://drive.google.com/file/d/1qltkY31-013JDQn-u2pmnjPyCaUcOqsV/view?usp=sharing

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Papers - Image - Coco - Panoptic

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27
Efficient Transformer Encoders for Mask2Former-style models

Paper • 2404.15244 • Published Apr 23 • 1

Papers - Image - Coco - Annotation RLHF

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Papers - Image - Mask - box-kMaX over kMaX-DeepLab

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Papers - Image - Coco - Annotation Pipeline

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Papers - Image - Dataset - LVIS

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11 • 30
COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27
GLIGEN: Open-Set Grounded Text-to-Image Generation

Paper • 2301.07093 • Published Jan 17, 2023 • 3
Grounded Language-Image Pre-training

Paper • 2112.03857 • Published Dec 7, 2021 • 3

Papers - Image - Coco Testing

Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion

Paper • 2310.03502 • Published Oct 5, 2023 • 77
Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Paper • 2404.07448 • Published Apr 11 • 11
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11 • 30
COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

Papers - ByteDance

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3 • 64
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Paper • 2404.07987 • Published Apr 11 • 47
COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23 • 34

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs