Can anyone tell me the function and usage of LTX-2.3-OmniNFT-RL-Lora_bf16?

#61

by soxon - opened 5 days ago

Discussion

soxon

5 days ago

Sorry, I don't really understand. I want to know the effect after using it and how to use it.

orzechowy3333

4 days ago

•

edited 4 days ago

Google AI asked "what is OmniNFT-RL-Lora" says:

OmniNFT-RL-Lora refers to a cutting-edge AI research framework designed to improve how generative models simultaneously create audio and video. It uses reinforcement learning (RL) and Low-Rank Adaptation (LoRA) to fix alignment and synchronization problems in "Twin-DiT" (Diffusion Transformer) models like LTX-2.

https://www.google.com/search?client=opera&q=what+is+OmniNFT-RL-Lora&sourceid=opera&ie=UTF-8&oe=UTF-8

Kijai

Owner 4 days ago

It's brand new so I don't have that much information.

The source is https://huggingface.co/zghhui/OmniNFT

tldr: it makes the model work better

RuneXX

4 days ago

•

edited 4 days ago

Some examples here https://zghhui.github.io/OmniNFT/ (bottom of page)
Looks like its giving a bit of improvements

ZKong

4 days ago

Not obvious in my tests,but worse when strength set to 1.

RuneXX

4 days ago

It's subtle.. but not an extensive test, just a few test runs... seems to be slightly more natural perhaps

anr2me

4 days ago

Some examples here https://zghhui.github.io/OmniNFT/ (bottom of page)
Looks like its giving a bit of improvements

Most of the examples seems to shows a more accurate speakers, especially when there are multiple characters 🤔 with less background sound too.

But the example where the baseline is a photorealistic girl while the OmniNFT became anime girl feels strange 😅 may be it was trained more on anime/cartoon 🤔

Kijai

Owner 3 days ago

An important note about something I initially missed: They have set alpha to 64 in their config entry, while the lora is rank 32... this means to get the intended default effect the lora strength should be 2.0 in ComfyUI.

giredo

3 days ago

Their adapter lora is 1.2GB while your lora is 617MB. What is the difference?

olivetty

3 days ago

Their adapter lora is 1.2GB while your lora is 617MB. What is the difference?

He has downcast it from fp32 to fp16, it's half the size.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment