Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -7,14 +7,13 @@ This is a quantization of the [phi-4](https://huggingface.co/microsoft/phi-4).
|
|
| 7 |
|
| 8 |
The phi-4 model is a cutting-edge open-source LLM developed using a diverse mix of synthetic datasets, curated public domain web content, and acquired academic resources, including books and Q&A datasets. This deliberate data selection ensures the training of compact yet highly capable models with an emphasis on quality and advanced reasoning. To further enhance its performance, phi-4 underwent a rigorous alignment process that included supervised fine-tuning and direct preference optimization, resulting in precise instruction adherence and robust safety measures.
|
| 9 |
## Evaluations
|
| 10 |
-
This model provides an accuracy recovery of 99.
|
| 11 |
|
| 12 |
| __English__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
| 13 |
|:--------------|:------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
| 14 |
| Avg. | 70.75 | 70.7 |
|
| 15 |
| Arc | 68.7 | 68.7 |
|
| 16 |
| Hellaswag | 72.8 | 72.7 |
|
| 17 |
-
| MMLU | 79.46 | 79.67 |
|
| 18 |
| | | |
|
| 19 |
| __French__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
| 20 |
| Avg. | 68.67 | 68.87 |
|
|
@@ -48,7 +47,7 @@ Install **vLLM** and
|
|
| 48 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
| 49 |
|
| 50 |
```
|
| 51 |
-
python -m vllm.entrypoints.openai.api_server --model cortecs/phi-4-FP8-Dynamic
|
| 52 |
```
|
| 53 |
Access the model:
|
| 54 |
```
|
|
|
|
| 7 |
|
| 8 |
The phi-4 model is a cutting-edge open-source LLM developed using a diverse mix of synthetic datasets, curated public domain web content, and acquired academic resources, including books and Q&A datasets. This deliberate data selection ensures the training of compact yet highly capable models with an emphasis on quality and advanced reasoning. To further enhance its performance, phi-4 underwent a rigorous alignment process that included supervised fine-tuning and direct preference optimization, resulting in precise instruction adherence and robust safety measures.
|
| 9 |
## Evaluations
|
| 10 |
+
This model provides an accuracy recovery of 99.68%.
|
| 11 |
|
| 12 |
| __English__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
| 13 |
|:--------------|:------------------------------------------------------|:-----------------------------------------------------------------------------------|
|
| 14 |
| Avg. | 70.75 | 70.7 |
|
| 15 |
| Arc | 68.7 | 68.7 |
|
| 16 |
| Hellaswag | 72.8 | 72.7 |
|
|
|
|
| 17 |
| | | |
|
| 18 |
| __French__ | __[phi-4](https://huggingface.co/microsoft/phi-4)__ | __[phi-4-FP8-Dynamic (this)](https://huggingface.co/cortecs/phi-4-FP8-Dynamic)__ |
|
| 19 |
| Avg. | 68.67 | 68.87 |
|
|
|
|
| 47 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
| 48 |
|
| 49 |
```
|
| 50 |
+
python -m vllm.entrypoints.openai.api_server --model cortecs/phi-4-FP8-Dynamic --max-model-len 16384
|
| 51 |
```
|
| 52 |
Access the model:
|
| 53 |
```
|