Update README.md
Browse files
README.md
CHANGED
@@ -156,7 +156,7 @@ Below are DeciCoder's pass@1 on MultiPL HumanEval scores
|
|
156 |
| Infery LLM | 3,889.3 | 3.075 | 11,676.8 | 1.729 |
|
157 |
|
158 |
- Latency - Total generation time of batch size 1 (prefill+generate)
|
159 |
-
- Throughput (tokens/sec) - Measured with optimal
|
160 |
|
161 |
## Documentation
|
162 |
|
|
|
156 |
| Infery LLM | 3,889.3 | 3.075 | 11,676.8 | 1.729 |
|
157 |
|
158 |
- Latency - Total generation time of batch size 1 (prefill+generate)
|
159 |
+
- Throughput (tokens/sec) - Measured with optimal batch size per hardware - A10 on BS 128, A100 on BS 512
|
160 |
|
161 |
## Documentation
|
162 |
|