QuantStack/LTX-2-GGUF

I am sure they do not contain imatrix data, it is allowed to make fake IQ quants by using no calibration datasets. Real IQ quants get calibrated, but it's more like for LLMs rather than image and video models, llama.cpp can't convert such models natively and we've been using hacky patches

natalie5

2 days ago

•

edited 2 days ago

I am sure they do not contain imatrix data, it is allowed to make fake IQ quants by using no calibration datasets. Real IQ quants get calibrated, but it's more like for LLMs rather than image and video models, llama.cpp can't convert such models natively and we've been using hacky patches

Oh, I am not sure how he does it, but if what you are saying is true then no point in iQ quants for diffusion models I guess. I think https://huggingface.co/unsloth/LTX-2-GGUF is the best GGUF version available for now.

anr2me

2 days ago

Unsloth Dynamic 2.0 is a kind of imatrix too isn't 🤔 claimed to be improvement from standard iMatrix.

qpqpqpqpqpqp

2 days ago

if what you are saying is true

For imatrix calibration of a LLM just a random bunch of text is needed, but diffusion models would need another different calibration approach because they're different

...then no point in iQ quants for diffusion models

No, they may be useful, NL, XS and XXS quants are smaller than S

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment