badaoui HF Staff commited on
Commit
3761afd
·
verified ·
1 Parent(s): fe8a4ea

Add link to Neuron-optimized version

Browse files

🤖 Neuron Export Bot: Adding link to Neuron-optimized version.

A Neuron-optimized version of this model has been created at [badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron).

The optimized version provides improved performance on AWS Inferentia/Trainium instances with pre-compiled artifacts.

Generated by: [badaoui](https://huggingface.co/badaoui)
Generated using: [Optimum Neuron Compiler Space](https://huggingface.co/spaces/optimum/neuron-export)

Files changed (1) hide show
  1. README.md +13 -1
README.md CHANGED
@@ -63,4 +63,16 @@ print(outputs[0]["generated_text"])
63
  # How many helicopters can a human eat in one sitting?</s>
64
  # <|assistant|>
65
  # ...
66
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
63
  # How many helicopters can a human eat in one sitting?</s>
64
  # <|assistant|>
65
  # ...
66
+ ```
67
+
68
+ ---
69
+ ## 🚀 AWS Neuron Optimized Version Available
70
+
71
+ A Neuron-optimized version of this model is available for improved performance on AWS Inferentia/Trainium instances:
72
+
73
+ **[badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron](https://huggingface.co/badaoui/TinyLlama-TinyLlama-1.1B-Chat-v1.0-neuron)**
74
+
75
+ The Neuron-optimized version provides:
76
+ - Pre-compiled artifacts for faster loading
77
+ - Optimized performance on AWS Neuron devices
78
+ - Same model capabilities with improved inference speed