empower-dev
/

llama3-empower-functions-small

Text Generation

function-calling

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

liuylhf commited on May 17

Commit

785c488

•

1 Parent(s): bf19110

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -51,9 +51,7 @@ We benchmarked our model against a few other options, on [three datasets](https:
 - Multi-Turn Dataset: Designed to simulate a complex real-world environment, such as a healthcare appointment booking system, the model navigates between natural conversation, initiating function calls, asking clarifying questions, and, when necessary, transferring to customer service. The assessment focuses on the accuracy of intent classification and the correctness of function calls.
-In the benchmark, we compared the model against other function-calling models including GPT-4, GPT-3.5, Firefunctions, Together.ai, and Anyscale. For Together.ai and Anyscale, we used mistralai/Mixtral-8x7B-Instruct-v0.1, as it represents their best offering. empower-functions consistently deliver superior performance in all scenarios, especially in the multi-turn dataset and the parallel-calling dataset, which are closer to real-world use cases.
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6424a49f12ba34f9894ab9b7/_jBEMv9vN30kz3m9auJWz.png)
 ## Demo App
 Check our healthcare appointment booking [demo](https://app.empower.dev/chat-demo)

 - Multi-Turn Dataset: Designed to simulate a complex real-world environment, such as a healthcare appointment booking system, the model navigates between natural conversation, initiating function calls, asking clarifying questions, and, when necessary, transferring to customer service. The assessment focuses on the accuracy of intent classification and the correctness of function calls.
+For more detailed evaluation result, please refer to our [github repo](https://github.com/empower-ai/empower-functions)
 ## Demo App
 Check our healthcare appointment booking [demo](https://app.empower.dev/chat-demo)