v0.38.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.38.0 for changelog.
- DeepLabV3-ResNet50_w8a8.dlc +2 -2
- precompiled/qualcomm-snapdragon-x-elite/DeepLabV3-ResNet50_w8a8.bin → DeepLabV3-ResNet50_w8a8.tflite +2 -2
- README.md +15 -14
- precompiled/qualcomm-snapdragon-x-elite/DeepLabV3-ResNet50_w8a8.onnx.zip +0 -3
- precompiled/qualcomm-snapdragon-x-elite/tool-versions.yaml +0 -3
- tool-versions.yaml +3 -2
DeepLabV3-ResNet50_w8a8.dlc
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5395744697d27cfb50fc992df427707f869f14ed84cadd46749c4f68c17f84ee
|
| 3 |
+
size 41458708
|
precompiled/qualcomm-snapdragon-x-elite/DeepLabV3-ResNet50_w8a8.bin → DeepLabV3-ResNet50_w8a8.tflite
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:28c3e66c0d9b950f69c8bfe2131b17cfdbd1300246849f429402027e57b18650
|
| 3 |
+
size 40457472
|
README.md
CHANGED
|
@@ -36,19 +36,20 @@ More details on model performance across various devices, can be found
|
|
| 36 |
|
| 37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 38 |
|---|---|---|---|---|---|---|---|---|
|
| 39 |
-
| DeepLabV3-ResNet50 | w8a8 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC |
|
| 40 |
-
| DeepLabV3-ResNet50 | w8a8 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC |
|
| 41 |
-
| DeepLabV3-ResNet50 | w8a8 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC |
|
| 42 |
-
| DeepLabV3-ResNet50 | w8a8 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC |
|
| 43 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 44 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 45 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 46 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 47 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 48 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 49 |
-
| DeepLabV3-ResNet50 | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC |
|
| 50 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
| 51 |
-
| DeepLabV3-ResNet50 | w8a8 |
|
|
|
|
| 52 |
|
| 53 |
|
| 54 |
|
|
@@ -130,7 +131,7 @@ from qai_hub_models.models.deeplabv3_resnet50 import Model
|
|
| 130 |
torch_model = Model.from_pretrained()
|
| 131 |
|
| 132 |
# Device
|
| 133 |
-
device = hub.Device("Samsung Galaxy
|
| 134 |
|
| 135 |
# Trace model
|
| 136 |
input_shape = torch_model.get_input_spec()
|
|
|
|
| 36 |
|
| 37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 38 |
|---|---|---|---|---|---|---|---|---|
|
| 39 |
+
| DeepLabV3-ResNet50 | w8a8 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 441225.106 ms | 1 - 395 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 40 |
+
| DeepLabV3-ResNet50 | w8a8 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 425275.28 ms | 13 - 565 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 41 |
+
| DeepLabV3-ResNet50 | w8a8 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 264705.408 ms | 10 - 31 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 42 |
+
| DeepLabV3-ResNet50 | w8a8 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 363242.695 ms | 1 - 390 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 43 |
+
| DeepLabV3-ResNet50 | w8a8 | RB5 (Proxy) | Qualcomm® QCS8250 (Proxy) | TFLITE | 3276.866 ms | 0 - 106 MB | NPU | [DeepLabV3-ResNet50.tflite](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.tflite) |
|
| 44 |
+
| DeepLabV3-ResNet50 | w8a8 | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 441225.106 ms | 1 - 395 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 45 |
+
| DeepLabV3-ResNet50 | w8a8 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 260939.61 ms | 12 - 34 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 46 |
+
| DeepLabV3-ResNet50 | w8a8 | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 381873.4 ms | 1 - 394 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 47 |
+
| DeepLabV3-ResNet50 | w8a8 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 261728.526 ms | 1 - 21 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 48 |
+
| DeepLabV3-ResNet50 | w8a8 | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 363242.695 ms | 1 - 390 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 49 |
+
| DeepLabV3-ResNet50 | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 254186.213 ms | 1 - 515 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 50 |
+
| DeepLabV3-ResNet50 | w8a8 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | TFLITE | 262956.065 ms | 11 - 422 MB | NPU | [DeepLabV3-ResNet50.tflite](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.tflite) |
|
| 51 |
+
| DeepLabV3-ResNet50 | w8a8 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 274680.73 ms | 17 - 324 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 52 |
+
| DeepLabV3-ResNet50 | w8a8 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 358211.043 ms | 132 - 132 MB | NPU | [DeepLabV3-ResNet50.dlc](https://huggingface.co/qualcomm/DeepLabV3-ResNet50/blob/main/DeepLabV3-ResNet50_w8a8.dlc) |
|
| 53 |
|
| 54 |
|
| 55 |
|
|
|
|
| 131 |
torch_model = Model.from_pretrained()
|
| 132 |
|
| 133 |
# Device
|
| 134 |
+
device = hub.Device("Samsung Galaxy S25")
|
| 135 |
|
| 136 |
# Trace model
|
| 137 |
input_shape = torch_model.get_input_spec()
|
precompiled/qualcomm-snapdragon-x-elite/DeepLabV3-ResNet50_w8a8.onnx.zip
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:4950e16b03785942a37b5032198ce6a3958d39df4f130005f2edd397611f9c37
|
| 3 |
-
size 35956025
|
|
|
|
|
|
|
|
|
|
|
|
precompiled/qualcomm-snapdragon-x-elite/tool-versions.yaml
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
tool_versions:
|
| 2 |
-
precompiled_qnn_onnx:
|
| 3 |
-
qairt: 2.36.4.250725200057_123280
|
|
|
|
|
|
|
|
|
|
|
|
tool-versions.yaml
CHANGED
|
@@ -1,3 +1,4 @@
|
|
| 1 |
tool_versions:
|
| 2 |
-
|
| 3 |
-
qairt: 2.
|
|
|
|
|
|
| 1 |
tool_versions:
|
| 2 |
+
tflite:
|
| 3 |
+
qairt: 2.38.0.250901140452_125126
|
| 4 |
+
tflite: 2.17.0
|