Update README.md
Browse files
README.md
CHANGED
@@ -51,9 +51,7 @@ We benchmarked our model against a few other options, on [three datasets](https:
|
|
51 |
|
52 |
- Multi-Turn Dataset: Designed to simulate a complex real-world environment, such as a healthcare appointment booking system, the model navigates between natural conversation, initiating function calls, asking clarifying questions, and, when necessary, transferring to customer service. The assessment focuses on the accuracy of intent classification and the correctness of function calls.
|
53 |
|
54 |
-
|
55 |
-
|
56 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6424a49f12ba34f9894ab9b7/_jBEMv9vN30kz3m9auJWz.png)
|
57 |
|
58 |
## Demo App
|
59 |
Check our healthcare appointment booking [demo](https://app.empower.dev/chat-demo)
|
|
|
51 |
|
52 |
- Multi-Turn Dataset: Designed to simulate a complex real-world environment, such as a healthcare appointment booking system, the model navigates between natural conversation, initiating function calls, asking clarifying questions, and, when necessary, transferring to customer service. The assessment focuses on the accuracy of intent classification and the correctness of function calls.
|
53 |
|
54 |
+
For more detailed evaluation result, please refer to our [github repo](https://github.com/empower-ai/empower-functions)
|
|
|
|
|
55 |
|
56 |
## Demo App
|
57 |
Check our healthcare appointment booking [demo](https://app.empower.dev/chat-demo)
|