Huanzhi Mao commited on
Commit
2b538b7
1 Parent(s): 383da93

update description

Browse files
Files changed (1) hide show
  1. app.py +4 -4
app.py CHANGED
@@ -1029,7 +1029,7 @@ with gr.Blocks() as demo:
1029
  "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
1030
  )
1031
  gr.Markdown(
1032
- """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation through execution.**
1033
 
1034
  **FC = native support for function/tool calling.**
1035
 
@@ -1046,7 +1046,7 @@ with gr.Blocks() as demo:
1046
  "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
1047
  )
1048
  gr.Markdown(
1049
- """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation through execution.**
1050
 
1051
  **FC = native support for function/tool calling.**
1052
 
@@ -1064,8 +1064,8 @@ with gr.Blocks() as demo:
1064
 
1065
  We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
1066
 
1067
- **AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation through execution.
1068
-
1069
  **Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
1070
 
1071
  **Latency** is measured in seconds.
 
1029
  "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
1030
  )
1031
  gr.Markdown(
1032
+ """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
1033
 
1034
  **FC = native support for function/tool calling.**
1035
 
 
1046
  "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
1047
  )
1048
  gr.Markdown(
1049
+ """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
1050
 
1051
  **FC = native support for function/tool calling.**
1052
 
 
1064
 
1065
  We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
1066
 
1067
+ **AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation by executing all the API calls the LLM generates.
1068
+
1069
  **Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
1070
 
1071
  **Latency** is measured in seconds.