inference-optimization/granite-4.0-h-tiny-FP8-block
Text Generation
•
7B
•
Updated
•
57
FP8-block, FP8-dynamic, NVFP4, w4a16, w8a8 quantized models of ibm-granite/granite-4.0-h-small and ibm-granite/granite-4.0-h-tiny models