This is the KVLink5 model of the paper "KVLink: Accelerating LLMs via Efficient KV Cache Reuse."

Downloads last month
32
Safetensors
Model size
4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support