Sylvain Filoni
fffiloni
AI & ML interests
ML for Animation • Alumni Arts Déco Paris
Articles
Organizations
fffiloni's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
a
paper
1 day ago
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper
•
2407.11398
•
Published
•
8
Kinetic Typography Diffusion Model
Paper
•
2407.10476
•
Published
•
1
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Paper
•
2407.19548
•
Published
•
20
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Paper
•
2407.19474
•
Published
•
22
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
Paper
•
2407.19593
•
Published
•
12
Artist: Aesthetically Controllable Text-Driven Stylization without Training
Paper
•
2407.15842
•
Published
•
11
AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
Paper
•
2407.10738
•
Published
•
2
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Paper
•
2407.16260
•
Published
•
1
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Paper
•
2407.18907
•
Published
•
37
Text2Place: Affordance-aware Text Guided Human Placement
Paper
•
2407.15446
•
Published
•
1
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Paper
•
2407.17952
•
Published
•
26
Floating No More: Object-Ground Reconstruction from a Single Image
Paper
•
2407.18914
•
Published
•
17
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper
•
2407.14177
•
Published
•
41
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Paper
•
2407.01494
•
Published
•
12
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation
Paper
•
2407.02869
•
Published
•
18
Video-to-Audio Generation with Hidden Alignment
Paper
•
2407.07464
•
Published
•
16
Still-Moving: Customized Video Generation without Customized Video Data
Paper
•
2407.08674
•
Published
•
11
Video Diffusion Alignment via Reward Gradients
Paper
•
2407.08737
•
Published
•
47
Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity
Paper
•
2407.10387
•
Published
•
6
IMAGDressing-v1: Customizable Virtual Dressing
Paper
•
2407.12705
•
Published
•
11
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Paper
•
2407.12579
•
Published
•
1
Shape of Motion: 4D Reconstruction from a Single Video
Paper
•
2407.13764
•
Published
•
18
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
Paper
•
2407.14329
•
Published
•
3
Stable Audio Open
Paper
•
2407.14358
•
Published
•
20
LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding
Paper
•
2407.15754
•
Published
•
19
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
Paper
•
2407.15642
•
Published
•
10
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation
Paper
•
2407.15060
•
Published
•
9
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Paper
•
2407.16655
•
Published
•
28
OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person
Paper
•
2407.16224
•
Published
•
21
Article
Image-based search engine
By
•
•
21Article
How I train a LoRA: m3lt style training overview
By
•
•
38![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
a
paper
about 1 month ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
an
article
about 1 month ago
Article
Thoughts on LoRA Training #1
By
•
•
29![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1583646260758-5e64858c87403103f9f1055d.png)
upvoted
a
collection
about 1 month ago
I4VGen: Image as Stepping Stone for Text-to-Video Generation
Paper
•
2406.02230
•
Published
•
15
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Paper
•
2406.02509
•
Published
•
8
Searching Priors Makes Text-to-Video Synthesis Better
Paper
•
2406.03215
•
Published
•
11
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
Paper
•
2406.02511
•
Published
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
an
article
2 months ago
Article
Indexify: Bringing HuggingFace Models to Real-Time Pipelines for Production Applications
By
•
•
7![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
an
article
2 months ago
Article
AI has a problem with objectifying women
By
•
•
54ReVideo: Remake a Video with Motion and Content Control
Paper
•
2405.13865
•
Published
•
22
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
Paper
•
2405.08054
•
Published
•
21
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
Paper
•
2405.09062
•
Published
•
8
Toon3D: Seeing Cartoons from a New Perspective
Paper
•
2405.10320
•
Published
•
19
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Paper
•
2405.11473
•
Published
•
53
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper
•
2404.16022
•
Published
•
17
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
Paper
•
2401.16465
•
Published
•
10
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Paper
•
2404.17672
•
Published
•
18
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
Paper
•
2404.19759
•
Published
•
24
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
•
2404.19427
•
Published
•
71
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper
•
2404.18212
•
Published
•
27
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Paper
•
2405.01434
•
Published
•
50
ZeST: Zero-Shot Material Transfer from a Single Image
Paper
•
2404.06425
•
Published
•
5
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
•
2404.14700
•
Published
•
29
MotionMaster: Training-free Camera Motion Transfer For Video Generation
Paper
•
2404.15789
•
Published
•
10
HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models
Paper
•
2311.17528
•
Published
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/61868ce808aae0b5499a2a95/F6BA0anbsoY_Z7M1JrwOe.jpeg)
upvoted
a
paper
4 months ago