fp8

#5
by festr2 - opened

Hello,

is fp8 release planned?

+1 would be good for serving with vllm and fitting into 4xH200

Also waiting for FP8, is there a release schedule for FP8 and other quants?

Will FP8 also run on 2x H200 with vllm?

Sign up or log in to comment