Eungbean Lee

Eungbean

https://eungbean.com

AI & ML interests

Computer Vision

Recent Activity

liked a Space 26 days ago

gokaygokay/Flux-TRELLIS

liked a model 29 days ago

ostris/ip-composition-adapter

liked a model 29 days ago

ostris/OpenFLUX.1

View all activity

Organizations

Eungbean's activity

liked a Space 26 days ago

FLUX TRELLIS

🏢

3D Generation from text prompts

liked 2 models 29 days ago

ostris/ip-composition-adapter

Text-to-Image • Updated Mar 20, 2024 • 174

ostris/OpenFLUX.1

Text-to-Image • Updated Oct 3, 2024 • 6.24k • 648

liked a model 8 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.12M • • 3.75k

liked a model 9 months ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 22.6k • • 4.72k

updated a collection 9 months ago

T2I

Collection

2 items • Updated Jun 13, 2024

updated a collection 11 months ago

T2I

Collection

2 items • Updated Jun 13, 2024

liked a model 11 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 420k • 6.08k

reacted to multimodalart's post with 👍 11 months ago

Post

The Stable Diffusion 3 research paper broken down, including some overlooked details! 📝

Model
📏 2 base model variants mentioned: 2B and 8B sizes

📐 New architecture in all abstraction levels:
- 🔽 UNet; ⬆️ Multimodal Diffusion Transformer, bye cross attention 👋
- 🆕 Rectified flows for the diffusion process
- 🧩 Still a Latent Diffusion Model

📄 3 text-encoders: 2 CLIPs, one T5-XXL; plug-and-play: removing the larger one maintains competitiveness

🗃️ Dataset was deduplicated with SSCD which helped with memorization (no more details about the dataset tho)

Variants
🔁 A DPO fine-tuned model showed great improvement in prompt understanding and aesthetics
✏️ An Instruct Edit 2B model was trained, and learned how to do text-replacement

Results
✅ State of the art in automated evals for composition and prompt understanding
✅ Best win rate in human preference evaluation for prompt understanding, aesthetics and typography (missing some details on how many participants and the design of the experiment)

Paper: https://stabilityai-public-packages.s3.us-west-2.amazonaws.com/Stable+Diffusion+3+Paper.pdf

3 replies