view article Article SUPIR Full Tutorial + 1 Click 12GB VRAM Windows & RunPod / Linux Installer + Batch Upscale + Comparison With Magnific - SUPIR Starts A New Era By MonsterMMORPG β’ Feb 28, 2024 β’ 10
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper β’ 2501.15369 β’ Published 21 days ago β’ 12
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 6 days ago β’ 294
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 4 days ago β’ 90
π§ Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration β’ 7 items β’ Updated Nov 18, 2024 β’ 28
Transcription Collection Transcribe interviews for free with Whisper in Spaces. β’ 10 items β’ Updated Oct 1, 2024 β’ 8
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format β’ 11 items β’ Updated Jul 2, 2024 β’ 9
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models Paper β’ 2403.02246 β’ Published Mar 4, 2024 β’ 1
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Paper β’ 2403.16422 β’ Published Mar 25, 2024 β’ 1
Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians Paper β’ 2403.17898 β’ Published Mar 26, 2024 β’ 15
GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers Paper β’ 2409.04196 β’ Published Sep 6, 2024 β’ 14
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper β’ 2409.02634 β’ Published Sep 4, 2024 β’ 93
Sapiens Collection Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens β’ 72 items β’ Updated Sep 18, 2024 β’ 55
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 β’ 186