adding nvidia l4 inference speed info
Browse files
README.md
CHANGED
@@ -40,6 +40,8 @@ The model was trained and tested in the following languages:
|
|
40 |
| NVIDIA A10 | FP32 | 4 ms | 84 ms |
|
41 |
| NVIDIA T4 | FP16 | 3 ms | 65 ms |
|
42 |
| NVIDIA T4 | FP32 | 15 ms | 362 ms |
|
|
|
|
|
43 |
|
44 |
**Note that the Answer Finder models are only used at query time.**
|
45 |
|
|
|
40 |
| NVIDIA A10 | FP32 | 4 ms | 84 ms |
|
41 |
| NVIDIA T4 | FP16 | 3 ms | 65 ms |
|
42 |
| NVIDIA T4 | FP32 | 15 ms | 362 ms |
|
43 |
+
| NVIDIA L4 | FP16 | 2 ms | 38 ms |
|
44 |
+
| NVIDIA L4 | FP32 | 5 ms | 124 ms |
|
45 |
|
46 |
**Note that the Answer Finder models are only used at query time.**
|
47 |
|