Instructions to use Lightricks/LTX-2.3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Lightricks/LTX-2.3 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Lightricks/LTX-2.3", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
Why is it impossible to control the volume of speaking characters?
Example
I tried every possible combination of "talk really quietly" I could think of, then asked Gemma 26B A4B for more, tried probably 20 combinations of various takes on “whisper" and "murmur". The result? Literally impossibly to change the volume of spoken audio, nor is it possible to change speaking distance.
Example:
Person on other side of room sounds like they are right next to the primary camera subject. No "speaks softly" or "from a distance" logic works.
Please advise, and fix. I couldn't even find a place to report this as a bug, so I guess this will have to do.
For anyone that wants to propose I just bring my own audio, yes I'm aware that can be done, and no, I don't think it's a reasonable ask. A pretty glaring bug exists, so the community is seeking remedy.