qualcomm
/

Baichuan-7B

@@ -36,7 +36,6 @@ accross various devices, can be found [here](https://aihub.qualcomm.com/models/b
   - Token generator output: 1 output token + KVCache for next iteration
   - Decoding length: 1024 (1 output token + 1023 from KVCache)
   - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
-  - QNN-SDK: 2.19
 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model

   - Token generator output: 1 output token + KVCache for next iteration
   - Decoding length: 1024 (1 output token + 1023 from KVCache)
   - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model