Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
steerapi
/
Llama-2-7b-chat-hf-onnx-awq-w8-g128
like
0
Text Generation
Transformers
ONNX
llama
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Llama-2-7b-chat-hf-onnx-awq-w8-g128
/
onnx
13 GB
1 contributor
History:
2 commits
steerapi
Upload folder using huggingface_hub
e4de35d
over 2 years ago
decoder_model_merged_quantized.onnx
8.81 MB
xet
Upload folder using huggingface_hub
over 2 years ago
decoder_model_merged_quantized.onnx_data
13 GB
xet
Upload folder using huggingface_hub
over 2 years ago
quantize_config.json
Safe
992 Bytes
Upload folder using huggingface_hub
over 2 years ago