MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper • 2406.18790 • Published 25 days ago • 32
Text-to-Image History Collection How Text-to-Image evolved on HF and inspired the Community • 49 items • Updated Jun 6 • 10
view article Article Sentiment Classification with Fully Homomorphic Encryption using Concrete ML Nov 17, 2022 • 2
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24 • 21
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 60
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Paper • 2404.17569 • Published Apr 26 • 11
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23
— UI is a good thing 💅 — Collection cool spaces with a cool UI, what could be better? • 5 items • Updated Jun 18 • 13
view article Article SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model By xingxm • Apr 19 • 4
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing Paper • 2404.09990 • Published Apr 15 • 12
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 149
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 87
Multimodal Models Collection Multimodal models with leading performance. • 8 items • Updated May 26 • 7
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper • 2403.13806 • Published Mar 20 • 18
Latent Consistency Model Demos Collection Latent Consistency Models for Stable Diffusion • 8 items • Updated Nov 12, 2023 • 24
VLMs for 3D reconstructions and their evaluation Collection List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality • 11 items • Updated Dec 5, 2023 • 2
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 131 items • Updated 6 days ago • 28
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey Paper • 2403.01528 • Published Mar 3 • 1
LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated 29 days ago • 352
Pretrained Text-Generation Models Below 250M Parameters Collection Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. • 7 items • Updated May 13 • 7
Soft Prompts Collection Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. • 4 items • Updated Mar 22 • 2
based Collection These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. • 14 items • Updated May 14 • 8
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper • 2402.11753 • Published Feb 19 • 5
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 87
OWL-series 🦉 Collection Models and applications of OWL-ViT and OWLv2. • 13 items • Updated Mar 11 • 4
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper • 2311.07069 • Published Nov 13, 2023 • 43
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper • 2311.06772 • Published Nov 12, 2023 • 33
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper • 2310.17994 • Published Oct 27, 2023 • 8
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper • 2306.10012 • Published Jun 16, 2023 • 34