5 18 4

Guy Yariv

GuyYariv

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 4 days ago

Charting and Navigating Hugging Face's Model Atlas

authored a paper 4 days ago

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

upvoted a paper 5 days ago

"Principal Components" Enable A New Language of Images

View all activity

Organizations

GuyYariv's activity

upvoted a paper 4 days ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published 5 days ago • 63

upvoted 2 papers 5 days ago

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published 7 days ago • 11

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

Paper • 2503.09601 • Published 6 days ago • 14

upvoted a paper 21 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 27 days ago • 66

upvoted a paper about 1 month ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 31

upvoted a paper about 2 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 70

upvoted a paper 2 months ago

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation

Paper • 2501.03059 • Published Jan 6 • 22

upvoted a paper 4 months ago

Edge Weight Prediction For Category-Agnostic Pose Estimation

Paper • 2411.16665 • Published Nov 25, 2024 • 6

upvoted a paper 8 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

upvoted 2 papers 9 months ago

Dataset Size Recovery from LoRA Weights

Paper • 2406.19395 • Published Jun 27, 2024 • 19

Improving Visual Commonsense in Language Models via Multiple Image Generation

Paper • 2406.13621 • Published Jun 19, 2024 • 13

upvoted a paper 11 months ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 4 papers about 1 year ago

Video Editing via Factorized Diffusion Distillation

Paper • 2403.09334 • Published Mar 14, 2024 • 23

Video as the New Language for Real-World Decision Making

Paper • 2402.17139 • Published Feb 27, 2024 • 20

Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency

Paper • 2310.03734 • Published Oct 5, 2023 • 15

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 43

upvoted 2 papers over 1 year ago

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Paper • 2309.16429 • Published Sep 28, 2023 • 11

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Paper • 2305.13050 • Published May 22, 2023 • 3