Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -36,7 +36,6 @@ accross various devices, can be found [here](https://aihub.qualcomm.com/models/b
|
|
36 |
- Token generator output: 1 output token + KVCache for next iteration
|
37 |
- Decoding length: 1024 (1 output token + 1023 from KVCache)
|
38 |
- Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
|
39 |
-
- QNN-SDK: 2.19
|
40 |
|
41 |
|
42 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
|
|
36 |
- Token generator output: 1 output token + KVCache for next iteration
|
37 |
- Decoding length: 1024 (1 output token + 1023 from KVCache)
|
38 |
- Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
|
|
|
39 |
|
40 |
|
41 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|