ComfyUI-QuantOps

Extended quantization layouts for ComfyUI, enabling loading and inference with models quantized by convert_to_quant.

This is experimental and due to lack of proper support and merging of PR in ComfyUI, do not expect this to work without putting in the effort. I don't have the time or the energy to keep this up and will close entire project if i keep getting bunch of low effort issues posted expecting me go serve a fix up on a silver platter.

tl;dr Go complain at ComfyOrg. Not here.

The following is the last update I make regarding this.

In order to use int8_tensorwise(RTX 30xx-series or newer GPU) you will need the following:

torch 2.10+cu130 or higher
installed the latest of my custom comfy-kitchen fork wheels with the int8-tensorwise support
enable the use of triton backend by using --enable-triton-backend launch argument in ComfyUI

Step 1: Install Triton Activate your virtual environment used by ComfyUI and install triton. For Windows you need to use this but linux can install latest triton as usual.

# for torch 2.10 and 2.11
pip install -U "triton-windows<3.7"
# for torch 2.12
pip install -U "triton-windows<3.8"

Step 3: Install my comfy-kitchen Download the latest uploaded version matching you python of my pre-compiled .whl file from my HuggingFace repository (Latest as of 18 May 2026)

Install it directly pointing to the file path:

pip install --no-deps --force-reinstall --no-cache-dir "path/to/comfy-kitchen.whl"

Step 4: Install/Update ComfyUI-QuantOps You just need to ensure it's fully up to date to read the new model formats. Run these commands:

cd custom_nodes/ComfyUI-QuantOps
git pull

When launching Comfyui add launch argument:

--enable-triton-backend

You can get most of the models here: https://huggingface.co/silveroxides

License

MIT License

Acknowledgements

lyogavin for PR #10864 to ComfyUI.
Clybius for inspiring me to take on quantization and his Learned-Rounding repository.

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.github/workflows		.github/workflows
assets		assets
kernels		kernels
nodes		nodes
quant_layouts		quant_layouts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
bnb4bit_ops.py		bnb4bit_ops.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
unified_ops.py		unified_ops.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI-QuantOps

tl;dr Go complain at ComfyOrg. Not here.

The following is the last update I make regarding this.

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-QuantOps

tl;dr Go complain at ComfyOrg. Not here.

The following is the last update I make regarding this.

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages