Sylvain Filoni
fffiloni
AI & ML interests
ML for Animation • Alumni Arts Déco Paris
Articles
Organizations
fffiloni's activity
upvoted
a
paper
about 19 hours ago
upvoted
a
paper
6 days ago
upvoted
an
article
21 days ago
Article
Finetune Mixtral 8x7B with AutoTrain
By
•
•
2upvoted
a
paper
21 days ago
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Paper
•
2403.13044
•
Published
•
13
LITA: Language Instructed Temporal-Localization Assistant
Paper
•
2403.19046
•
Published
•
16
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Paper
•
2403.14781
•
Published
•
13
ReNoise: Real Image Inversion Through Iterative Noising
Paper
•
2403.14602
•
Published
•
19
DreamReward: Text-to-3D Generation with Human Preference
Paper
•
2403.14613
•
Published
•
33
MyVLM: Personalizing VLMs for User-Specific Queries
Paper
•
2403.14599
•
Published
•
14
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Paper
•
2403.13745
•
Published
•
10
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper
•
2403.07508
•
Published
•
68
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Paper
•
2403.08764
•
Published
•
33
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Paper
•
2403.08268
•
Published
•
15
StableDrag: Stable Dragging for Point-based Image Editing
Paper
•
2403.04437
•
Published
•
23
VideoMamba: State Space Model for Efficient Video Understanding
Paper
•
2403.06977
•
Published
•
21
DragAnything: Motion Control for Anything using Entity Representation
Paper
•
2403.07420
•
Published
•
11
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper
•
2403.01779
•
Published
•
25
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
•
2403.01422
•
Published
•
24
AtomoVideo: High Fidelity Image-to-Video Generation
Paper
•
2403.01800
•
Published
•
18
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Paper
•
2403.00483
•
Published
•
8
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
Paper
•
2402.18842
•
Published
•
13
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
120
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper
•
1910.03771
•
Published
•
15
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation
Paper
•
2402.17245
•
Published
•
10
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Paper
•
2402.17723
•
Published
•
15
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper
•
2402.17177
•
Published
•
87
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model
Paper
•
2402.17412
•
Published
•
21
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
182
Multi-LoRA Composition for Image Generation
Paper
•
2402.16843
•
Published
•
28
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing
Paper
•
2402.15151
•
Published
•
7
Seamless Human Motion Composition with Blended Positional Encodings
Paper
•
2402.15509
•
Published
•
12
Genie: Generative Interactive Environments
Paper
•
2402.15391
•
Published
•
67
PALO: A Polyglot Large Multimodal Model for 5B People
Paper
•
2402.14818
•
Published
•
22
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper
•
2402.14658
•
Published
•
77
Music Style Transfer with Time-Varying Inversion of Diffusion Models
Paper
•
2402.13763
•
Published
•
9
Neural Network Diffusion
Paper
•
2402.13144
•
Published
•
92
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
•
2402.13217
•
Published
•
18
Video ReCap: Recursive Captioning of Hour-Long Videos
Paper
•
2402.13250
•
Published
•
18
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Paper
•
2402.09727
•
Published
•
35
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing
Paper
•
2402.10294
•
Published
•
19
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Paper
•
2402.11929
•
Published
•
9
FiT: Flexible Vision Transformer for Diffusion Model
Paper
•
2402.12376
•
Published
•
46
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Paper
•
2401.02955
•
Published
•
16
Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors
Paper
•
2401.06126
•
Published
•
2
FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring
Paper
•
2401.03707
•
Published
•
1
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Paper
•
2401.08503
•
Published
•
3
Lumiere: A Space-Time Diffusion Model for Video Generation
Paper
•
2401.12945
•
Published
•
82
ActAnywhere: Subject-Aware Video Background Generation
Paper
•
2401.10822
•
Published
•
11