Qwen
/

Qwen2.5-1.5B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

Add link to Neuron-optimized version

#14

by badaoui HF Staff - opened Nov 3

base: refs/heads/main

←

from: refs/pr/14

Discussion Files changed

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -107,4 +107,16 @@ If you find our work helpful, feel free to give us a cite.
       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
-```

       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
+```
+---
+## 🚀 AWS Neuron Optimized Version Available
+A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances:
+**[badaoui/Qwen-Qwen2.5-1.5B-Instruct-neuron](https://huggingface.co/badaoui/Qwen-Qwen2.5-1.5B-Instruct-neuron)**
+The Neuron-optimized version provides:
+- Pre-compiled artifacts for faster loading
+- Optimized performance on AWS Neuron devices
+- Same model capabilities with improved inference speed