InfImagine/imagenet_features_1024_sd_vae_ft_ema Viewer • Updated Nov 6, 2024 • 1.44M • 14 • 2
InfImagine/imagenet1k_features_256_sd_vae_ft_ema Viewer • Updated Nov 6, 2024 • 3.09M • 11 • 2
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published Oct 17, 2024 • 23
Adaptive Rotated Convolution for Rotated Object Detection Paper • 2303.07820 • Published Mar 14, 2023
FiT: Flexible Vision Transformer for Diffusion Model Paper • 2402.12376 • Published Feb 19, 2024 • 48
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published Oct 17, 2024 • 23
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines Paper • 2407.08418 • Published Jul 11, 2024
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines Paper • 2407.08418 • Published Jul 11, 2024
Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households Paper • 2404.09001 • Published Apr 13, 2024
Diffusion Models Need Visual Priors for Image Generation Paper • 2410.08531 • Published Oct 11, 2024 • 1
Diffusion Models Need Visual Priors for Image Generation Paper • 2410.08531 • Published Oct 11, 2024 • 1
Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images Paper • 2304.13023 • Published Apr 25, 2023 • 1
LLaMA Pro: Progressive LLaMA with Block Expansion Paper • 2401.02415 • Published Jan 4, 2024 • 53
GLaMa: Joint Spatial and Frequency Loss for General Image Inpainting Paper • 2205.07162 • Published May 15, 2022
GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI Paper • 2409.01392 • Published Sep 2, 2024 • 9