Toggle "use_memory_efficient_attention" off for CPU/MPS/Default GPU usage
#15 opened 12 days ago
by
OSalem99
Onnx models are built using an older opset.
👍
2
#11 opened 9 months ago
by
mistborn
fix TEI support
#9 opened 10 months ago
by
GarciaLnk