view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 21 days ago β’ 51
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Paper β’ 2404.17569 β’ Published 26 days ago β’ 10
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. β’ 21 items β’ Updated 26 days ago β’ 21
β UI is a good thing π β Collection cool spaces with a cool UI, what could be better? β’ 4 items β’ Updated 27 days ago β’ 10
view article Article SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model By xingxm β’ Apr 19 β’ 2
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing Paper β’ 2404.09990 β’ Published Apr 15 β’ 11
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 129
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 16 days ago β’ 77
Multimodal Models Collection Multimodal models with leading performance. β’ 5 items β’ Updated Apr 11 β’ 4
HyperGraph Datasets Collection Collection of HyperGraph Datasets β’ 17 items β’ Updated Apr 4 β’ 7
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper β’ 2403.13806 β’ Published Mar 20 β’ 18
Latent Consistency Model Demos Collection Latent Consistency Models for Stable Diffusion β’ 8 items β’ Updated Nov 12, 2023 β’ 24
VLMs for 3D reconstructions and their evaluation Collection List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality β’ 11 items β’ Updated Dec 5, 2023 β’ 2
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) β’ 109 items β’ Updated 1 day ago β’ 23
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey Paper β’ 2403.01528 β’ Published Mar 3 β’ 1
TnT-LLM: Text Mining at Scale with Large Language Models Paper β’ 2403.12173 β’ Published Mar 18 β’ 17
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 70 items β’ Updated 6 days ago β’ 308
Pretrained Text-Generation Models Below 250M Parameters Collection Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. β’ 7 items β’ Updated 9 days ago β’ 6
Soft Prompts Collection Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. β’ 4 items β’ Updated Mar 22 β’ 2
based Collection These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. β’ 14 items β’ Updated 7 days ago β’ 8
FiT: Flexible Vision Transformer for Diffusion Model Paper β’ 2402.12376 β’ Published Feb 19 β’ 46
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper β’ 2402.11753 β’ Published Feb 19 β’ 4
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 77
OWL-series π¦ Collection Models and applications of OWL-ViT and OWLv2. β’ 13 items β’ Updated Mar 11 β’ 3
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 43
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper β’ 2311.06772 β’ Published Nov 12, 2023 β’ 33
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper β’ 2310.17994 β’ Published Oct 27, 2023 β’ 7
LP-MusicCaps: LLM-Based Pseudo Music Captioning Paper β’ 2307.16372 β’ Published Jul 31, 2023 β’ 33
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper β’ 2306.10012 β’ Published Jun 16, 2023 β’ 33