About gguf version 0.9.1

#1
by Evados - opened

I believe there might be an issue with GGUF 0.9.1.
This GGUF gives the same results as GGUF 0.9.0.

For example, if I compare GGUF 0.9.1 and GGUF 0.9.0, the results are almost 97% identical.
However, if I compare the original non-GGUF model 0.9.1 and 0.9.0, the results are completely different.

Could there be an issue with GGUF 0.9.1 Q8 and Q4?
Something is not right.

Here a quick workflow for test and see the differences.
Edited:
Use the V3 version in my other message below, as the OLD workflow is outdated.
https://www.mediafire.com/file/m4atam4pmrvpzw5/test_ltx_gguf.json

Results:
GGUF 0.9.0

GGUF 0.9.1

Original 0.9.0

Original 0.9.1

did you fix the seed (under sampler)? what is your prompt? ok, seems the same structure
Screenshot 2024-12-22.png

Yes I have give you a simple workflow, you can see the difference.

thanks; according to the console statistics, the new set is faster than before; will look into the source file, see anything mess up

If you can't test my workflow here a quick video exemple.
https://www.youtube.com/watch?v=jCnH3kl1Sic

thanks for the video review and output comparison; just re-converted the original safetensors, the recent general.file_type is [1], but it should be [32] (seems irrelevant; since 1=F16; 32=BF16; which is no problem)
Screenshot 2024-12-22 0.9 f16.png
Screenshot 2024-12-22 0.9.1 bf16.png

Good, thank you!

If any users are reading this message, I’ve made some updates to my workflow ltx model test.
This workflow tests both models 0.9.0 and 0.9.1, includes an experimental I2V method, and provides an example demonstrating how to extend the T2V mode video and how to faceswap.
If it’s helpful to you, that makes me happy.

Updated workflow with both VAEs 0.9.0 and 0.9.1:
https://www.mediafire.com/file/n56u2qyyzut1vxk/Dave_Gravel_LTX_TEST_PLUS_LONG_VIDEO_EXEMPLE_V3.zip

Edited:
This video show how to use my workflow test and how to faceswap with a custom face.
https://www.youtube.com/watch?v=9cfB9sQWLfg

Screenshot 2024-12-23 032258.png

seems need a different architecture vae to make the 0.9.1 works differently

Yes, I see now. I have updated my workflow to test both models with both VAEs and added a method to save the VAE from the original model. It seems to be working now. Thanks a lot!

Edited:
This video show how to use my workflow test and how to faceswap with a custom face.
https://www.youtube.com/watch?v=9cfB9sQWLfg

Version: 0.9.0 + vae 0.9.0

Version: 0.9.1 + vae 0.9.1

Good, thank you!

If any users are reading this message, I’ve made some updates to my workflow ltx model test.
This workflow tests both models 0.9.0 and 0.9.1, includes an experimental I2V method, and provides an example demonstrating how to extend the T2V mode video and how to faceswap.
If it’s helpful to you, that makes me happy.

Updated workflow with both VAEs 0.9.0 and 0.9.1:
https://www.mediafire.com/file/n56u2qyyzut1vxk/Dave_Gravel_LTX_TEST_PLUS_LONG_VIDEO_EXEMPLE_V3.zip

Edited:
This video show how to use my workflow test and how to faceswap with a custom face.
https://www.youtube.com/watch?v=9cfB9sQWLfg

Screenshot 2024-12-23 032258.png

Could you upload the VAE from version 0.9.1 please? The one in your workflow is "LTX_VAE_0.9.1_FBFLOAT16". The one posted everywhere from version 0.9 does not give good result with 0.9.1 GGUF model.

Hi PixelPlayer,
I don't really have enough space to upload a file of this size.
And it's quite simple to obtain on your own. You need to download the 0.9.1 model in its original version, place it in the 'checkpoint' folder of ComfyUI, load the model into a regular checkpoint node in ComfyUI.
At the VAE pin, you need to connect a 'save VAE' node, and this will save the model's VAE into a folder in ComfyUI, which you can then use with GGUF.

I created a special sampler node for the LTX model and noticed that the VAE doesn't seem to work well with the new updates to the tiled VAE decode node. This appears to cause several glitches or artifacts in the video.
Alternatively, the default settings of the tiled decode node might not be suitable for the LTX model.
When I use a simple decode, I experience almost no glitches or artifacts.

thanks @Evados , just tested; it works very good; taken couple seconds only to generate a short (less than one minute) with the q4_0; great!

Screenshot 2024-12-29.png

hi @PixelPlayer , you should find the new vae here and the workflow; credit should be given to @Evados ; Merry Christmas and Happy New Year 2025 (is coming)!

It makes me happy! Happy holidays and a Happy New Year to you too. Thank you as well for the GGUF.

Thanks guys! @Evados for the explanation, @calcuis for the extraction. Happy New Year!

If you liked my first workflow, here's another version with a simple custom node I created to include a sampler refiner.
You'll find the workflow and the custom node in the video description.
Have fun!
https://www.youtube.com/watch?v=hGLNOXOypRo

I downloaded ltx-video-2b-v0.9.1-bf16.gguf this week and had same results out as ltx-video-2b-v0.9-f16.gguf. for 0.9 I used eth vae from here and for 0.9.1 as I have the full as well already so I used vae from ltx-video-2b-v0.9.1.safetensors just to be sure, same result. If I tested same WF with ltx-video-2b-v0.9.1.safetensors then results changed.

I downloaded the full ltx-video-2b-v0.9.safetensors and can confirm a video made with this or ltx-video-2b-v0.9-f16.gguf. are the same BUT 0.9.1 or at least teh bf16 (I could see no f16 for 0.91)I will try one of the other maybe the q8 or or q4 they seem to be 1 day newer so maybe?

@Evados nice WF, a lot going on, but great. Not sure if you have one that was designed for just 1 model (no compare) think with the color coding I was able to strip it back to do just one. Also did you see any differences in the o.9 and 0.9.1 gguf?

Sign up or log in to comment