Instructions to use Lightricks/LTX-2.3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Lightricks/LTX-2.3 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Lightricks/LTX-2.3", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
Mobile deployment question β can LTX-2.3 run on phones?
LTX-2.3's efficiency is impressive. Question for the Lightricks team: has anyone tested this on mobile?
We have a 40-phone farm (Snapdragon 865) and we're looking for the first practical mobile video generation model.
Specific questions:
- What's the minimum model size after 4-bit quantization?
- Can it generate 3-second clips at 512x512 on a phone?
- Any plan for a mobile-optimized variant?
- Would you be open to collaborating on a mobile LTX deployment?
We're in Sharjah, UAE β happy to share phone farm benchmarks in exchange!
- Dispatch AI (FZE), Sharjah UAE
Hi,
When running on a phone, you have the advantage of the shared memory architecture, but you are most likely bound on compute.
LTX-2.3 is extremely efficient in that sense, given the low number of tokens allow for less compute in attention. That said, in order to make it useable on SnapDragon or A series processors, it's best to invest in refining it.
My goto would be:
- Weight distillation to a smaller model - 22B parameters is still a lot, even after quantization.
- Quantization Aware Training to go to 4 bits, most likely INT4 as the target since it's mobile chips.
- Finetune on the target resolution, as well as step distillation - The model is geared towards 1080p and up, if you want 512x512 it's best to finetune and while you're at it, distill it to 1/2 steps.
Currently, our focus in mobile edge is for Physical AI, so it's not exactly the same as phones.
