fp8
#5
by
festr2
- opened
Hello,
is fp8 release planned?
+1 would be good for serving with vllm and fitting into 4xH200
Also waiting for FP8, is there a release schedule for FP8 and other quants?
Will FP8 also run on 2x H200 with vllm?