markoarnauto
commited on
Commit
•
5c362f5
1
Parent(s):
ca180a7
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,3 +1,4 @@
|
|
|
|
1 |
|
2 |
license: other
|
3 |
license_link: https://llama.meta.com/llama3/license/
|
@@ -63,4 +64,4 @@ curl http://localhost:8000/v1/completions -H "Content-Type: application/json
|
|
63 |
"prompt": "San Francisco is a"
|
64 |
} '
|
65 |
```
|
66 |
-
This model is optimized to handle heavy workloads providing a total throughput of ️**1485 tokens per second** using one NVIDIA H100 ⚡
|
|
|
1 |
+
---
|
2 |
|
3 |
license: other
|
4 |
license_link: https://llama.meta.com/llama3/license/
|
|
|
64 |
"prompt": "San Francisco is a"
|
65 |
} '
|
66 |
```
|
67 |
+
⚡ This model is optimized to handle heavy workloads providing a total throughput of ️**1485 tokens per second** using one NVIDIA H100 ⚡
|