Sylvain Filoni
fffiloni
AI & ML interests
ML for Animation • Alumni Arts Déco Paris
Articles
Organizations
fffiloni's activity
upvoted
a
paper
4 days ago
upvoted
a
paper
6 days ago
upvoted
an
article
8 days ago
Article
AI has a problem with objectifying women
By
•
•
52upvoted
a
paper
9 days ago
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
Paper
•
2405.08054
•
Published
•
21
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
Paper
•
2405.09062
•
Published
•
7
Toon3D: Seeing Cartoons from a New Perspective
Paper
•
2405.10320
•
Published
•
19
upvoted
a
paper
13 days ago
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper
•
2404.16022
•
Published
•
16
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
Paper
•
2401.16465
•
Published
•
9
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Paper
•
2404.17672
•
Published
•
18
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
Paper
•
2404.19759
•
Published
•
23
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
•
2404.19427
•
Published
•
67
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper
•
2404.18212
•
Published
•
24
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Paper
•
2405.01434
•
Published
•
47
ZeST: Zero-Shot Material Transfer from a Single Image
Paper
•
2404.06425
•
Published
•
4
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
•
2404.14700
•
Published
•
28
MotionMaster: Training-free Camera Motion Transfer For Video Generation
Paper
•
2404.15789
•
Published
•
10
HiDiffusion: Unlocking High-Resolution Creativity and Efficiency in Low-Resolution Trained Diffusion Models
Paper
•
2311.17528
•
Published
•
4
upvoted
an
article
about 2 months ago
Article
Finetune Mixtral 8x7B with AutoTrain
By
•
•
4SpatialTracker: Tracking Any 2D Pixels in 3D Space
Paper
•
2404.04319
•
Published
•
21
Learning Inclusion Matching for Animation Paint Bucket Colorization
Paper
•
2403.18342
•
Published
•
1
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Paper
•
2403.17005
•
Published
•
13
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
Paper
•
2404.02101
•
Published
•
17
Video Interpolation with Diffusion Models
Paper
•
2404.01203
•
Published
•
2
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Paper
•
2403.17694
•
Published
•
10
Streaming Dense Video Captioning
Paper
•
2404.01297
•
Published
•
10
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Paper
•
2404.00987
•
Published
•
21
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Paper
•
2403.13044
•
Published
•
14
LITA: Language Instructed Temporal-Localization Assistant
Paper
•
2403.19046
•
Published
•
16
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Paper
•
2403.14781
•
Published
•
14
ReNoise: Real Image Inversion Through Iterative Noising
Paper
•
2403.14602
•
Published
•
19
DreamReward: Text-to-3D Generation with Human Preference
Paper
•
2403.14613
•
Published
•
33
MyVLM: Personalizing VLMs for User-Specific Queries
Paper
•
2403.14599
•
Published
•
14
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Paper
•
2403.13745
•
Published
•
10
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper
•
2403.07508
•
Published
•
71
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Paper
•
2403.08764
•
Published
•
34
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Paper
•
2403.08268
•
Published
•
15
StableDrag: Stable Dragging for Point-based Image Editing
Paper
•
2403.04437
•
Published
•
24
VideoMamba: State Space Model for Efficient Video Understanding
Paper
•
2403.06977
•
Published
•
22
DragAnything: Motion Control for Anything using Entity Representation
Paper
•
2403.07420
•
Published
•
11
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper
•
2403.01779
•
Published
•
26
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
•
2403.01422
•
Published
•
24
AtomoVideo: High Fidelity Image-to-Video Generation
Paper
•
2403.01800
•
Published
•
18
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Paper
•
2403.00483
•
Published
•
8
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
Paper
•
2402.18842
•
Published
•
13
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
125
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper
•
1910.03771
•
Published
•
15
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation
Paper
•
2402.17245
•
Published
•
10
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Paper
•
2402.17723
•
Published
•
16
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper
•
2402.17177
•
Published
•
87
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model
Paper
•
2402.17412
•
Published
•
21
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
182
Multi-LoRA Composition for Image Generation
Paper
•
2402.16843
•
Published
•
28
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing
Paper
•
2402.15151
•
Published
•
7
Seamless Human Motion Composition with Blended Positional Encodings
Paper
•
2402.15509
•
Published
•
12
Genie: Generative Interactive Environments
Paper
•
2402.15391
•
Published
•
67
PALO: A Polyglot Large Multimodal Model for 5B People
Paper
•
2402.14818
•
Published
•
22