Papers - Audio - Embedding - Time - Sinusoidal Cross Attensi Collection by matlok Apr 18 - Long-form music generation with latent diffusion Paper • 2404.10301 • Published Apr 16 • 24
Papers - Audio - RoPE Collection by matlok Apr 17 - Long-form music generation with latent diffusion Paper • 2404.10301 • Published Apr 16 • 24
Papers - Audio - Decoders - DAC - No tanh activation The DAC decoder tanh caused harmonic distortion Collection by matlok Apr 17 - Long-form music generation with latent diffusion Paper • 2404.10301 • Published Apr 16 • 24
Papers - Audio - Activation - Snake Collection by matlok Apr 17 - Long-form music generation with latent diffusion Paper • 2404.10301 • Published Apr 16 • 24
Papers - Stability AI Collection by matlok Apr 17 - Long-form music generation with latent diffusion Paper • 2404.10301 • Published Apr 16 • 24
Imagen Collection by Alpra12 Sep 1 1 Relightify: Relightable 3D Faces from a Single Image via Diffusion Models Paper • 2305.06077 • Published May 10, 2023 • 2 Runtime error 1 👨🦰🔀👨🦰 FaceFusion DeepFake AI Tool for Videos & Images
Relightify: Relightable 3D Faces from a Single Image via Diffusion Models Paper • 2305.06077 • Published May 10, 2023 • 2
Diffusion models Collection by Salwa-Zeitoun Apr 18 1 UniFL: Improve Stable Diffusion via Unified Feedback Learning Paper • 2404.05595 • Published Apr 8 • 23 Aligning Diffusion Models by Optimizing Human Utility Paper • 2404.04465 • Published Apr 6 • 13
UniFL: Improve Stable Diffusion via Unified Feedback Learning Paper • 2404.05595 • Published Apr 8 • 23
Testing Collection by elchinm Apr 17 - facebook/mbart-large-50-many-to-many-mmt Translation • Updated Sep 28, 2023 • 255k • 317 facebook/mms-tts-azb Text-to-Speech • Updated Sep 1, 2023 • 130 DrishtiSharma/whisper-large-v2-azerbaijani Automatic Speech Recognition • Updated Dec 21, 2022 • 75 hajili/zephyr-7b-beta-azerbaijani-dolly-instruct Updated Nov 22, 2023 • 18 • 2