Apolinário from multimodal AI art's picture

Apolinário from multimodal AI art PRO

multimodalart

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture Google's profile picture Naver Papago's profile picture pix2pix-zero-library's profile picture 🧨Diffusers's profile picture AI FILMS's profile picture Gradio Client Demos's profile picture Adobe Research's profile picture ARC Lab, Tencent PCG's profile picture ControlNet 1.1 Preview's profile picture Augmented Imagination Hackathon's profile picture RWKV's profile picture AutoTrain Projects's profile picture ELITE's profile picture Data Days Zurich's profile picture HuggingFaceM4's profile picture Open-Source AI Meetup's profile picture lora concepts library's profile picture (De)fusing's profile picture Huggingface Projects's profile picture Tune a video concepts library's profile picture CompVis's profile picture Hugging Face H4's profile picture Stability AI's profile picture Hugging Face OSS Metrics's profile picture Weizmann Institute of Science's profile picture Invoke's profile picture CompVis Community's profile picture Stable Diffusion concepts library's profile picture DeepFloyd's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Testing org's profile picture temp-org's profile picture Kandinsky Community's profile picture Blog-explorers's profile picture WARP's profile picture Hands-On Generative AI with Transformers and Diffusion Models's profile picture Editing Images's profile picture ICCV2023's profile picture leditsplusplus's profile picture DeepLearning AI courses's profile picture Enterprise Explorers's profile picture GLITCH's profile picture CommonCanvas's profile picture Editable Dance Generation From Music's profile picture Latent Consistency's profile picture rtemp's profile picture StabilityAI_HuggingFace's profile picture OS Llamas Test's profile picture TTS Eval (OLD)'s profile picture Editing Audio's profile picture EDGE Editable Dance Generation's profile picture InstantX's profile picture Spaces Playground's profile picture Llamas vs Capybaras's profile picture TTS AGI's profile picture Social Post Explorers's profile picture +RAIN film festival's profile picture Top Contributors: Space Likes's profile picture zero gpu hacking's profile picture diffusers-internal-dev's profile picture Tencent Hunyuan's profile picture AuraFlow's profile picture rnri-inversion's profile picture Snapchat Inc.'s profile picture OpenCapybara's profile picture Latent Explorers's profile picture ZP's profile picture Meta Llama's profile picture flux train's profile picture Hugging Face FineVideo's profile picture levelsio LoRAs's profile picture Pyramid Flow's profile picture glitch 2024's profile picture RF Inversion's profile picture HunyuanVideo Community's profile picture

Posts 5

view post
Post
25358
The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is Tencent-Hunyuan/HunyuanDiT by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼️✨, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation!