fffiloni (Sylvain Filoni)

upvoted a paper 4 days ago

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published 5 days ago • 16

upvoted a paper 6 days ago

Implicit Style-Content Separation using B-LoRA

Paper • 2403.14572 • Published Mar 21 • 3

upvoted an article 8 days ago

Article

AI has a problem with objectifying women

By

•

10 days ago

• 52

upvoted a paper 9 days ago

ReVideo: Remake a Video with Motion and Content Control

Paper • 2405.13865 • Published 11 days ago • 21

upvoted 3 papers 10 days ago

Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

Paper • 2405.08054 • Published 20 days ago • 21

Naturalistic Music Decoding from EEG Data via Latent Diffusion Models

Paper • 2405.09062 • Published 19 days ago • 7

Toon3D: Seeing Cartoons from a New Perspective

Paper • 2405.10320 • Published 17 days ago • 19

upvoted a paper 13 days ago

FIFO-Diffusion: Generating Infinite Videos from Text without Training

Paper • 2405.11473 • Published 15 days ago • 53

upvoted 7 papers 26 days ago

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

Paper • 2404.19759 • Published Apr 30 • 23

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30 • 67

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paper • 2404.18212 • Published Apr 28 • 24

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2 • 47

upvoted 4 papers about 1 month ago

ZeST: Zero-Shot Material Transfer from a Single Image

Paper • 2404.06425 • Published Apr 9 • 4

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23 • 28

MotionMaster: Training-free Camera Motion Transfer For Video Generation

Paper • 2404.15789 • Published Apr 24 • 10

HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models

Paper • 2311.17528 • Published Nov 29, 2023 • 4

upvoted 2 papers about 2 months ago

Audio Dialogues: Dialogues dataset for audio and music understanding

Paper • 2404.07616 • Published Apr 11 • 14

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Paper • 2404.07987 • Published Apr 11 • 46

upvoted an article about 2 months ago

Article

Finetune Mixtral 8x7B with AutoTrain

By

•

Apr 1

• 4

upvoted 3 papers about 2 months ago

SpatialTracker: Tracking Any 2D Pixels in 3D Space

Paper • 2404.04319 • Published Apr 5 • 21

Learning Inclusion Matching for Animation Paint Bucket Colorization

Paper • 2403.18342 • Published Mar 27 • 1

TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models

Paper • 2403.17005 • Published Mar 25 • 13

upvoted 12 papers 2 months ago

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

Paper • 2404.02101 • Published Apr 2 • 17

Video Interpolation with Diffusion Models

Paper • 2404.01203 • Published Apr 1 • 2

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Paper • 2403.17694 • Published Mar 26 • 10

Streaming Dense Video Captioning

Paper • 2404.01297 • Published Apr 1 • 10

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

Paper • 2404.00987 • Published Apr 1 • 21

Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

Paper • 2403.13044 • Published Mar 19 • 14

LITA: Language Instructed Temporal-Localization Assistant

Paper • 2403.19046 • Published Mar 27 • 16

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Paper • 2403.14781 • Published Mar 21 • 14

ReNoise: Real Image Inversion Through Iterative Noising

Paper • 2403.14602 • Published Mar 21 • 19

DreamReward: Text-to-3D Generation with Human Preference

Paper • 2403.14613 • Published Mar 21 • 33

MyVLM: Personalizing VLMs for User-Specific Queries

Paper • 2403.14599 • Published Mar 21 • 14

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Paper • 2403.13745 • Published Mar 20 • 10

upvoted 23 papers 3 months ago

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12 • 71

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Paper • 2403.08764 • Published Mar 13 • 34

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

Paper • 2403.08268 • Published Mar 13 • 15

StableDrag: Stable Dragging for Point-based Image Editing

Paper • 2403.04437 • Published Mar 7 • 24

VideoMamba: State Space Model for Efficient Video Understanding

Paper • 2403.06977 • Published Mar 11 • 22

DragAnything: Motion Control for Anything using Entity Representation

Paper • 2403.07420 • Published Mar 12 • 11

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Paper • 2403.01779 • Published Mar 4 • 26

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

Paper • 2403.01422 • Published Mar 3 • 24

AtomoVideo: High Fidelity Image-to-Video Generation

Paper • 2403.01800 • Published Mar 4 • 18

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization

Paper • 2403.00483 • Published Mar 1 • 8

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Paper • 2402.18842 • Published Feb 29 • 13

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 125

HuggingFace's Transformers: State-of-the-art Natural Language Processing

Paper • 1910.03771 • Published Oct 9, 2019 • 15

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

Paper • 2402.17245 • Published Feb 27 • 10

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Paper • 2402.17723 • Published Feb 27 • 16

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 87

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model

Paper • 2402.17412 • Published Feb 27 • 21

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27 • 182

Multi-LoRA Composition for Image Generation

Paper • 2402.16843 • Published Feb 26 • 28

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

Paper • 2402.15151 • Published Feb 23 • 7

Seamless Human Motion Composition with Blended Positional Encodings

Paper • 2402.15509 • Published Feb 23 • 12

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23 • 67

PALO: A Polyglot Large Multimodal Model for 5B People

Paper • 2402.14818 • Published Feb 22 • 22

Sylvain Filoni

AI & ML interests

Articles

Breaking Barriers: The Critical Role of Art and Design in Advancing AI Capabilities

Organizations

fffiloni's activity

AI has a problem with objectifying women

Finetune Mixtral 8x7B with AutoTrain