qaihm-bot commited on
Commit
10e889c
·
verified ·
1 Parent(s): 223cd5e

See https://github.com/quic/ai-hub-models/releases/v0.42.0 for changelog.

README.md CHANGED
@@ -37,51 +37,53 @@ More details on model performance across various devices, can be found
37
 
38
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
39
  |---|---|---|---|---|---|---|---|---|
40
- | Swin-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 61.017 ms | 0 - 356 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
41
- | Swin-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 55.331 ms | 1 - 314 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
42
- | Swin-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 28.274 ms | 0 - 350 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
43
- | Swin-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 29.627 ms | 1 - 336 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
44
- | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 21.67 ms | 0 - 33 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
45
- | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 19.633 ms | 0 - 57 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
46
- | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 19.133 ms | 0 - 62 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
47
- | Swin-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 25.077 ms | 0 - 355 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
48
- | Swin-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 22.649 ms | 0 - 315 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
49
- | Swin-Base | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 61.017 ms | 0 - 356 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
50
- | Swin-Base | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 55.331 ms | 1 - 314 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
51
- | Swin-Base | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 21.677 ms | 0 - 41 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
52
- | Swin-Base | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 19.703 ms | 0 - 66 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
53
- | Swin-Base | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 31.906 ms | 0 - 347 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
54
- | Swin-Base | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 29.853 ms | 36 - 342 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
55
- | Swin-Base | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 21.816 ms | 0 - 33 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
56
- | Swin-Base | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 19.575 ms | 0 - 70 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
57
- | Swin-Base | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 25.077 ms | 0 - 355 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
58
- | Swin-Base | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 22.649 ms | 0 - 315 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
59
- | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 15.037 ms | 0 - 354 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
60
- | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 13.498 ms | 1 - 346 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
61
- | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 12.924 ms | 1 - 343 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
62
- | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | TFLITE | 11.895 ms | 0 - 355 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
63
- | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 10.218 ms | 1 - 313 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
64
- | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 9.635 ms | 1 - 314 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
65
- | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | TFLITE | 10.164 ms | 0 - 348 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
66
- | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 8.144 ms | 1 - 328 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
67
- | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | ONNX | 8.104 ms | 1 - 328 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
68
- | Swin-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 20.172 ms | 1019 - 1019 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
69
- | Swin-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 19.541 ms | 175 - 175 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
70
- | Swin-Base | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 39.535 ms | 0 - 277 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
71
- | Swin-Base | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 20.837 ms | 0 - 85 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
72
- | Swin-Base | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 20.981 ms | 0 - 279 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
73
- | Swin-Base | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | ONNX | 552.304 ms | 142 - 170 MB | CPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
74
- | Swin-Base | w8a16 | RB5 (Proxy) | Qualcomm® QCS8250 (Proxy) | ONNX | 620.883 ms | 140 - 156 MB | CPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
75
- | Swin-Base | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 39.535 ms | 0 - 277 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
76
- | Swin-Base | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 20.942 ms | 0 - 87 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
77
- | Swin-Base | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 20.715 ms | 0 - 84 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
78
- | Swin-Base | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 20.981 ms | 0 - 279 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
79
- | Swin-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 13.827 ms | 0 - 283 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
80
- | Swin-Base | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 10.808 ms | 0 - 266 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
81
- | Swin-Base | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 210.961 ms | 93 - 166 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
82
- | Swin-Base | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 8.134 ms | 0 - 273 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
83
- | Swin-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 21.72 ms | 404 - 404 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
84
- | Swin-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 186.699 ms | 133 - 133 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
 
 
85
 
86
 
87
 
@@ -95,9 +97,9 @@ pip install qai-hub-models
95
  ```
96
 
97
 
98
- ## Configure Qualcomm® AI Hub to run this model on a cloud-hosted device
99
 
100
- Sign-in to [Qualcomm® AI Hub](https://app.aihub.qualcomm.com/) with your
101
  Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
102
 
103
  With this API token, you can configure your client to run models on the cloud
@@ -105,7 +107,7 @@ hosted devices.
105
  ```bash
106
  qai-hub configure --api_token API_TOKEN
107
  ```
108
- Navigate to [docs](https://app.aihub.qualcomm.com/docs/) for more information.
109
 
110
 
111
 
@@ -216,7 +218,7 @@ With the output of the model, you can compute like PSNR, relative errors or
216
  spot check the output with expected output.
217
 
218
  **Note**: This on-device profiling and inference requires access to Qualcomm®
219
- AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
220
 
221
 
222
 
 
37
 
38
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
39
  |---|---|---|---|---|---|---|---|---|
40
+ | Swin-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 60.881 ms | 0 - 357 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
41
+ | Swin-Base | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 54.521 ms | 1 - 314 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
42
+ | Swin-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 28.192 ms | 0 - 355 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
43
+ | Swin-Base | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 28.389 ms | 0 - 334 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
44
+ | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 21.723 ms | 0 - 32 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
45
+ | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 19.277 ms | 0 - 68 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
46
+ | Swin-Base | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | ONNX | 19.159 ms | 1 - 56 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
47
+ | Swin-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 25.116 ms | 0 - 357 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
48
+ | Swin-Base | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 22.069 ms | 0 - 316 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
49
+ | Swin-Base | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 60.881 ms | 0 - 357 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
50
+ | Swin-Base | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 54.521 ms | 1 - 314 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
51
+ | Swin-Base | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 21.618 ms | 0 - 45 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
52
+ | Swin-Base | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 19.299 ms | 0 - 69 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
53
+ | Swin-Base | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 31.874 ms | 0 - 348 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
54
+ | Swin-Base | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 28.414 ms | 0 - 309 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
55
+ | Swin-Base | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 21.717 ms | 0 - 38 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
56
+ | Swin-Base | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 19.364 ms | 0 - 52 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
57
+ | Swin-Base | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 25.116 ms | 0 - 357 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
58
+ | Swin-Base | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 22.069 ms | 0 - 316 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
59
+ | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 15.006 ms | 46 - 408 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
60
+ | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 13.032 ms | 1 - 346 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
61
+ | Swin-Base | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 12.846 ms | 1 - 345 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
62
+ | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | TFLITE | 11.845 ms | 0 - 351 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
63
+ | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 9.895 ms | 1 - 313 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
64
+ | Swin-Base | float | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 9.591 ms | 1 - 314 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
65
+ | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | TFLITE | 9.823 ms | 0 - 351 MB | NPU | [Swin-Base.tflite](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.tflite) |
66
+ | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 8.018 ms | 0 - 330 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
67
+ | Swin-Base | float | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | ONNX | 8.042 ms | 0 - 327 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
68
+ | Swin-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 19.906 ms | 1035 - 1035 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.dlc) |
69
+ | Swin-Base | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 19.472 ms | 175 - 175 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base.onnx.zip) |
70
+ | Swin-Base | w8a16 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 38.0 ms | 0 - 264 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
71
+ | Swin-Base | w8a16 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 20.045 ms | 0 - 67 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
72
+ | Swin-Base | w8a16 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 20.221 ms | 0 - 264 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
73
+ | Swin-Base | w8a16 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | ONNX | 553.678 ms | 142 - 170 MB | CPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
74
+ | Swin-Base | w8a16 | RB5 (Proxy) | Qualcomm® QCS8250 (Proxy) | ONNX | 562.489 ms | 135 - 156 MB | CPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
75
+ | Swin-Base | w8a16 | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 38.0 ms | 0 - 264 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
76
+ | Swin-Base | w8a16 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 19.967 ms | 0 - 67 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
77
+ | Swin-Base | w8a16 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 19.984 ms | 0 - 74 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
78
+ | Swin-Base | w8a16 | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 20.221 ms | 0 - 264 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
79
+ | Swin-Base | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 13.369 ms | 0 - 270 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
80
+ | Swin-Base | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | QNN_DLC | 10.371 ms | 0 - 259 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
81
+ | Swin-Base | w8a16 | Samsung Galaxy S25 | Snapdragon® 8 Elite For Galaxy Mobile | ONNX | 207.429 ms | 93 - 166 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
82
+ | Swin-Base | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | QNN_DLC | 23.508 ms | 0 - 321 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
83
+ | Swin-Base | w8a16 | Snapdragon 7 Gen 4 QRD | Snapdragon® 7 Gen 4 Mobile | ONNX | 595.51 ms | 119 - 141 MB | CPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
84
+ | Swin-Base | w8a16 | Snapdragon 8 Elite Gen 5 QRD | Snapdragon® 8 Elite Gen5 Mobile | QNN_DLC | 8.022 ms | 4 - 277 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
85
+ | Swin-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 21.391 ms | 415 - 415 MB | NPU | [Swin-Base.dlc](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.dlc) |
86
+ | Swin-Base | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 197.49 ms | 133 - 133 MB | NPU | [Swin-Base.onnx.zip](https://huggingface.co/qualcomm/Swin-Base/blob/main/Swin-Base_w8a16.onnx.zip) |
87
 
88
 
89
 
 
97
  ```
98
 
99
 
100
+ ## Configure Qualcomm® AI Hub Workbench to run this model on a cloud-hosted device
101
 
102
+ Sign-in to [Qualcomm® AI Hub Workbench](https://workbench.aihub.qualcomm.com/) with your
103
  Qualcomm® ID. Once signed in navigate to `Account -> Settings -> API Token`.
104
 
105
  With this API token, you can configure your client to run models on the cloud
 
107
  ```bash
108
  qai-hub configure --api_token API_TOKEN
109
  ```
110
+ Navigate to [docs](https://workbench.aihub.qualcomm.com/docs/) for more information.
111
 
112
 
113
 
 
218
  spot check the output with expected output.
219
 
220
  **Note**: This on-device profiling and inference requires access to Qualcomm®
221
+ AI Hub Workbench. [Sign up for access](https://myaccount.qualcomm.com/signup).
222
 
223
 
224
 
Swin-Base_float.dlc CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3a04ddaad71b81e4b7f05f6d0adbc20dde9dc3451c430c36b608900a0f0273ce
3
- size 356148212
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd06b1be79cc3104c1f424c97aec537b832c32a8988d7cb9f6d10cb8ccaddf60
3
+ size 356456956
Swin-Base_float.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f0be41b37ff37d768940b39c5f569562bf9912a77321a56f50a88117406c3656
3
- size 326769810
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6ffe2f157faff91fcb6c940c111ce455637fffce1463b75f511b35861206d943
3
+ size 326781102
Swin-Base_w8a16.dlc CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:91b61a4688cc78239c29f95d20ab6fbf8424475fde1b1dc676d5beb07fbbeb1d
3
- size 94599332
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:694fc43fc5424fd5ef7a6fc917f39efb8d98f23df10c9bf2b6a8bfb469d203e0
3
+ size 94754604
Swin-Base_w8a16.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:43d40c42dd747efc0082d5b82348d8a03b0225edf8ccd73254f963415aba34dc
3
- size 307813684
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f84d3ff08f25b6c921ca2d424bfc9f3c005fbd1151d026c31ad99c7c5ccee656
3
+ size 307825135