Spaces:
Running
Running
metadata
title: README
emoji: 🐠
colorFrom: pink
colorTo: blue
sdk: static
pinned: false
Welcome to the Inference Acceleration Team under Zhejiang Innovation Research Institute. We are dedicated to achieving efficient large model inference on NVIDIA and domestic GPU platforms, with a focus on cutting-edge inference acceleration technologies such as speculative decoding and model quantization. Feel free to explore our space!