llama_cpp_python Windows CUDA Wheels (0.3.16)

Prebuilt Windows 64‑bit wheels for llama_cpp_python 0.3.16 with CUDA enabled. Includes Python 3.11 and Python 3.12.

CUDA Support

Built with:

  • sm_75 (RTX 20xx)
  • sm_86 (RTX 30xx)
  • sm_89 (RTX 40xx)
  • sm_120 (RTX 50xx)

Wheels

  • llama_cpp_python-0.3.16-cp311-cp311-win_amd64.whl
  • llama_cpp_python-0.3.16-cp312-cp312-win_amd64.whl
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support