will's picture

will PRO

wrice

·

AI & ML interests

Interested in the applications of generative models for Speech Synthesis and NLP.

Organizations

wrice's activity

upvoted a collection 27 days ago

Fastest timm models > 80% Top-1 ImageNet-1k

Fastest image classification models with 80% accuracy in ImageNet-1k . • 21 items • Updated Jun 12, 2024 • 2

upvoted a paper 8 months ago

Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data

Paper • 2408.10119 • Published Aug 19, 2024 • 17

upvoted 2 papers about 1 year ago

Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 16

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21, 2024 • 48

upvoted 6 papers over 1 year ago

Holistic Evaluation of Text-To-Image Models

Paper • 2311.04287 • Published Nov 7, 2023 • 16

Improving Sample Quality of Diffusion Models Using Self-Attention Guidance

Paper • 2210.00939 • Published Oct 3, 2022 • 6

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Paper • 2309.15818 • Published Sep 27, 2023 • 18

ProPainter: Improving Propagation and Transformer for Video Inpainting

Paper • 2309.03897 • Published Sep 7, 2023 • 27

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Paper • 2308.06873 • Published Aug 14, 2023 • 26

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Paper • 2308.01546 • Published Aug 3, 2023 • 18

upvoted a paper almost 2 years ago

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training

Paper • 2305.10763 • Published May 18, 2023 • 3