william cody stanford's picture

william cody stanford

williamcstanford

·

AI & ML interests

None yet

Organizations

None yet

williamcstanford's activity

upvoted a paper 7 months ago

LAB-Bench: Measuring Capabilities of Language Models for Biology Research

Paper • 2407.10362 • Published Jul 14, 2024 • 5

upvoted an article 8 months ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 155

upvoted a paper 8 months ago

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Paper • 2406.06612 • Published Jun 6, 2024 • 15

upvoted a collection 9 months ago

🎭 Avatars

The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 70 items • Updated Dec 24, 2024 • 81

upvoted a paper 11 months ago

Video as the New Language for Real-World Decision Making

Paper • 2402.17139 • Published Feb 27, 2024 • 19

upvoted 4 papers 12 months ago

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 71

Learning Continuous 3D Words for Text-to-Image Generation

Paper • 2402.08654 • Published Feb 13, 2024 • 11

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models

Paper • 2402.06178 • Published Feb 9, 2024 • 14

Memory Consolidation Enables Long-Context Video Understanding

Paper • 2402.05861 • Published Feb 8, 2024 • 9

upvoted 7 papers about 1 year ago

Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5, 2024 • 66

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Paper • 2402.03040 • Published Feb 5, 2024 • 18

Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion

Paper • 2402.03162 • Published Feb 5, 2024 • 19

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25, 2024 • 30

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Paper • 2401.01885 • Published Jan 3, 2024 • 28

Improving Diffusion-Based Image Synthesis with Context Prediction

Paper • 2401.02015 • Published Jan 4, 2024 • 8

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

Paper • 2401.01256 • Published Jan 2, 2024 • 21

upvoted a collection about 1 year ago

diffusion

30 items • Updated Jul 12, 2024 • 1

upvoted 3 papers about 1 year ago

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Paper • 2312.15770 • Published Dec 25, 2023 • 13

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

Paper • 2312.16693 • Published Dec 27, 2023 • 14

DreamTuner: Single Image is Enough for Subject-Driven Generation

Paper • 2312.13691 • Published Dec 21, 2023 • 27