Commit History

updated with new data
e05c716
Running

CoreyMorris commited on

Updated data and added notes about the site.
36799a9

CoreyMorris commited on

Updated with new results 11-21
3ebf7a7

CoreyMorris commited on

updated dashboard with new data
443052d

CoreyMorris commited on

check for URL and full model name
2037152

Corey Morris commited on

Added clickable links (#1)
59c6dd2
unverified

Corey commited on

updated description
cb2c32e

Corey Morris commited on

Added new results
f839734

Corey Morris commited on

removed example table with link
5d3a9b2

Corey Morris commited on

changed default scatter plot index
25bce6d

Corey Morris commited on

Loading new csv with updated data
a0c39f5

Corey Morris commited on

Moved moral scenarios information higher on page
41d7691

Corey Morris commited on

changed the wording of moral scenarios
e7c50af

Corey Morris commited on

stripping the whitespace from the input so that the filtering works with or without whitespace
e1345be

Corey Morris commited on

Support for multi column filtering using comma seperated values (#2)
246a992

CoreyMorris imdatta0 commited on

Updated
b601bef

Corey Morris commited on

Updated description and data
383dc16

Corey Morris commited on

Completed loaded form csv
8ef77e5

Corey Morris commited on

loading from csv instead of processing data each time
28e8799

Corey Morris commited on

WIP. Loading data from csv
1a1910c

Corey Morris commited on

updated date and model count
0c07f8b

Corey Morris commited on

Added new hugging face results
3f507e0

Corey Morris commited on

Updated to reflect number of models. Previously, I think there were duplicates
d396c1e

Corey Morris commited on

Show a random question from the moral scenarios evaluation
19c7c67

Corey Morris commited on

Updated model count
4f20e65

Corey Morris commited on

Added statement of removal of models
96ffe12

Corey Morris commited on

removed commented code
7fc9618

Corey Morris commited on

updated update data
280db99

Corey Morris commited on

Fixed type error
e79bcf3

Corey Morris commited on

WIP commit. Currently have nlargest error
d506f10

Corey Morris commited on

Updated the last updated date to 18Aug
42ff7b9

Corey Morris commited on

Updated description with more models
7f24726

Corey Morris commited on

fixed error
d7b89ce

Corey Morris commited on

Added google analytics snippet
9444cd2

Corey Morris commited on

Increased size of scatter plot
2b16774

Corey Morris commited on

Made the radar plot larger
f52387e

Corey Morris commited on

Moved radar plots to higher in the page
12a9766

Corey Morris commited on

Modified title and explanation to better reflect what the site is
18ec1ba

Corey Morris commited on

Moved radar chart to after analysis
fb25b1e

Corey Morris commited on

Added a default model to compare
7b77065

Corey Morris commited on

Improved clarity of explanation for Radar charts
a450af5

Corey Morris commited on

Fixed some of the diplicate model issue
618dcce

Corey Morris commited on

Table now displays the columns that have the top differences
dc21a69

Corey Morris commited on

removed charts with hardcoded tasks. removed hardcoding of model for other charts
a125eb8

Corey Morris commited on

Finding top differences between tasks from the target model
627e0f9

Corey Morris commited on

Added explanation for the plot and a dataframe of the models
2db58a0

Corey Morris commited on

Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
9695a47

Corey Morris commited on

Added header back for the table
2a7f691

Corey Morris commited on

Added citation for the site
ea8703d

Corey Morris commited on

Changed streamlit to wide layout to see more of the table
1e6b767

Corey Morris commited on