r0ymanesco
commited on
Commit
·
9076472
1
Parent(s):
ae806f6
Update README.md
Browse files
README.md
CHANGED
@@ -3,9 +3,11 @@ license: apache-2.0
|
|
3 |
---
|
4 |
# notdiamond-0001
|
5 |
|
6 |
-
notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks.
|
7 |
|
8 |
-
The model is
|
|
|
|
|
9 |
|
10 |
To use notdiamond-0001, format your queries using the following prompt with your query appended at the end
|
11 |
``` python
|
@@ -31,6 +33,4 @@ You can then determine the model to call as follows
|
|
31 |
model_id = logits.argmax().item()
|
32 |
model_to_call = id2label[model_id]
|
33 |
```
|
34 |
-
|
35 |
-
For more details on how you can integrate this into your techstack and have notdiamond-0001 help you reduce latency and cost, check out our [documentation](https://notdiamond.readme.io/reference/introduction-1).
|
36 |
-
You can also access the model through our free [API](https://www.notdiamond.ai/notdiamond-0001).
|
|
|
3 |
---
|
4 |
# notdiamond-0001
|
5 |
|
6 |
+
notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks.
|
7 |
|
8 |
+
The router model is free to use under the Apache 2.0 license. You can also access the model through our free [API](https://www.notdiamond.ai/notdiamond-0001).
|
9 |
+
|
10 |
+
This router model is a classifier and will return either GPT-3.5 or GPT-4. You determine which version of each model you want to use and make the calls client-side with your own keys.
|
11 |
|
12 |
To use notdiamond-0001, format your queries using the following prompt with your query appended at the end
|
13 |
``` python
|
|
|
33 |
model_id = logits.argmax().item()
|
34 |
model_to_call = id2label[model_id]
|
35 |
```
|
36 |
+
For more details on how you can integrate this into your techstack and have notdiamond-0001 help you reduce latency and cost, check out our [documentation](https://notdiamond.readme.io/reference/introduction-1).
|
|
|
|