Commit History

Leaderboard Update, in sync with PR#437 (Fixes For NexusHandler)
f84c9c3
Running

Huanzhi Mao commited on

update data.csv, in sync with BFCL PR#407
2424829

Huanzhi Mao commited on

BFCL May 14th Release
4675558
verified

HuanzhiMao commited on

BFCL April 27th Release
8b1abb6

Huanzhi Mao commited on

BFCL April 24th Release
6525881

Huanzhi Mao commited on

update data.csv, in sync with BFCL April 16th Release
0b85412

Huanzhi Mao commited on

update data.csv
bdd55a2

Huanzhi Mao commited on

add 95th percentile latency stats
73e032d

Huanzhi Mao commited on

update data.csv. April 9
ac0ce57

Huanzhi Mao commited on

change column order
7957873

Huanzhi Mao commited on

Update leaderboard with data from April 3 release
cd106c2

Huanzhi Mao commited on

update description
2b538b7

Huanzhi Mao commited on

update description
383da93

Huanzhi Mao commited on

update data and update links
c49d028

Huanzhi Mao commited on

add description
027abe2

Huanzhi Mao commited on

update data.csv
8a12377

Huanzhi Mao commited on

update data.csv
64cee2d

Huanzhi Mao commited on

update description
c94dd2f

Huanzhi Mao commited on

update data to include more models
23ba85c

Huanzhi Mao commited on

Update with Leaderboard V2
d4c7482

Huanzhi Mao commited on

gemma name change
7580374

Huanzhi Mao commited on

update data.csv
8920689

Huanzhi Mao commited on

typo fix
715b354

Huanzhi Mao commited on

update leaderboard to support new data.csv format
1d221f5

Huanzhi Mao commited on

update data.csv to include column name.
3ef6cec

Huanzhi Mao commited on

Support auto-populated leaderboard from csv file.
67249b1

Huanzhi Mao commited on

update leaderboard data to include new models
57013a0

Huanzhi Mao commited on

remove (FC) to be consistant
469201a

Huanzhi Mao commited on

Update model names to be more precise
e6dd0b3

Huanzhi Mao commited on

Fix: Cascading error resulting from silent error in Vertex AI API innovation in Leaderboard pipeline
04b3735

Huanzhi Mao commited on

Fix: Clerical Error in Simple Function (Evaluation by execution)
bb0e697

Huanzhi Mao commited on

update leaderboard data
9a12753
verified

HuanzhiMao commited on

make alert prettier
aad7801

Huanzhi Mao commited on

add alert for feedback
b9ab8a4

Huanzhi Mao commited on

add app.py
6767c79

Huanzhi Mao commited on