Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper • 2405.20541 • Published 12 days ago • 15
Terminus XL Collection v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24 • 6
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 41 items • Updated 7 days ago • 52
Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning Paper • 2305.18424 • Published May 28, 2023 • 1
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy Paper • 2310.01334 • Published Oct 2, 2023 • 3
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models Paper • 2310.02998 • Published Oct 4, 2023 • 1
Unraveling the Key Components of OOD Generalization via Diversification Paper • 2312.16313 • Published Dec 26, 2023 • 1
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization Paper • 2401.15914 • Published Jan 29 • 7
Ferret: Refer and Ground Anything Anywhere at Any Granularity Paper • 2310.07704 • Published Oct 11, 2023 • 10
ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection Paper • 2402.17888 • Published Feb 27 • 1
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes Paper • 2310.01840 • Published Oct 3, 2023 • 1
Protein Design & Protein Structure Prediction Collection Interactive Demos that can be used for protein structure prediction using AlphaFold2 or RoseTTAfold2, prediction of small metal ions • 7 items • Updated Sep 18, 2023 • 4
Spaces of the Week Collection My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗 • 6 items • Updated Apr 29 • 2
🚂 SD-XL Training Suite Collection All the steps to train your own SD-XL custom model • 5 items • Updated Feb 20 • 14
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23
Experimental Projects Collection Spaces that are too early or cutting edge for mainstream usage 🙂 • 4 items • Updated Nov 16, 2023 • 5
〽️MistralAI Collection A collection of MistralAI models that you can trust in production! • 10 items • Updated 8 days ago • 7
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 84
GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo Paper • 2404.07992 • Published Apr 11 • 2
InfMLLM: A Unified Framework for Visual-Language Tasks Paper • 2311.06791 • Published Nov 12, 2023 • 2
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices Paper • 2312.16886 • Published Dec 28, 2023 • 18
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models Paper • 2312.02949 • Published Dec 5, 2023 • 8
Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models Paper • 2312.06109 • Published Dec 11, 2023 • 19
Small Language Model Meets with Reinforced Vision Vocabulary Paper • 2401.12503 • Published Jan 23 • 30
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Paper • 2404.07972 • Published Apr 11 • 41
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 23 items • Updated May 8 • 41
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing Paper • 2404.05717 • Published Apr 8 • 23
Open-source speech datasets annotated using Data-Speech Collection Open-source annotated speech datasets ranging from 1,000 hours to soon 50,000 hours. • 7 items • Updated 27 days ago • 3
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper • 2403.13806 • Published Mar 20 • 18
Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering Paper • 2403.14554 • Published Mar 21 • 12
EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting Paper • 2403.15124 • Published Mar 22 • 1
My NLP Spaces Collection Hugging Face transformers fine-tuned for various NLP tasks using TensorFlow. • 13 items • Updated Apr 9 • 1
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper • 2404.01197 • Published Apr 1 • 29
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control Paper • 2403.09055 • Published Mar 14 • 24
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20 • 58
Trending 3D and Depth Demos Collection One place to keep track of all 3D and Depth demos • 14 items • Updated Apr 17 • 16
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing Paper • 2403.12032 • Published Mar 18 • 14
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts Paper • 2403.08268 • Published Mar 13 • 15
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 28 days ago • 312
BERT release Collection Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated 28 days ago • 15