Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
like
0
Follow
NM Testing
91
Transformers
kv-cache
fp8
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
1.67 kB
1 contributor
History:
4 commits
krishnateja95
Upload README.md with huggingface_hub
e610a52
verified
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
README.md
153 Bytes
Upload README.md with huggingface_hub
14 days ago