Id Card Recognition
Identify and verify ID documents
Generate Vietnamese speech from text and reference audio
Audio Conditioned LipSync with Latent Diffusion Models
Optical illusions and style transfer with FLUX
Create videos with FFMPEG + Qwen2.5-Coder