Commit History

Updated the last updated date to 18Aug
42ff7b9

Corey Morris commited on

Updated description with more models
7f24726

Corey Morris commited on

fixed error
d7b89ce

Corey Morris commited on

Added google analytics snippet
9444cd2

Corey Morris commited on

Increased size of scatter plot
2b16774

Corey Morris commited on

Made the radar plot larger
f52387e

Corey Morris commited on

Moved radar plots to higher in the page
12a9766

Corey Morris commited on

Modified title and explanation to better reflect what the site is
18ec1ba

Corey Morris commited on

Moved radar chart to after analysis
fb25b1e

Corey Morris commited on

Added a default model to compare
7b77065

Corey Morris commited on

Improved clarity of explanation for Radar charts
a450af5

Corey Morris commited on

Fixed some of the diplicate model issue
618dcce

Corey Morris commited on

Table now displays the columns that have the top differences
dc21a69

Corey Morris commited on

removed charts with hardcoded tasks. removed hardcoding of model for other charts
a125eb8

Corey Morris commited on

Finding top differences between tasks from the target model
627e0f9

Corey Morris commited on

Added explanation for the plot and a dataframe of the models
2db58a0

Corey Morris commited on

Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
9695a47

Corey Morris commited on

Added header back for the table
2a7f691

Corey Morris commited on

Added citation for the site
ea8703d

Corey Morris commited on

Changed streamlit to wide layout to see more of the table
1e6b767

Corey Morris commited on

Updated updated date
28d4d6a

Corey Morris commited on

Added filter for parameter count. Fixed model filter so that it only filters on the Model name (index of the table)
8474e43

Corey Morris commited on

Modified the selection of models and evaluations so that most do not show up by default. for a better user experience with 700+ models
0a33874

Corey Morris commited on

Added search for Model name and Task name
3abc48f

Corey Morris commited on

Added reasoning for having scatter plots
cb21769

Corey Morris commited on

Updated title now that there are over 700 open source models in the dataset
a9f9804

Corey Morris commited on

Added statement and hypothesis about moral scenarios
d97426f

Corey Morris commited on

Plots have a default title
f9a0f38

Corey Morris commited on

Refactor of create_plot
bdad6e6

Corey Morris commited on

Added finding from moral scenarios about threshold
1f8cc2a

Corey Morris commited on

Added bar chart for abstract algebra data.
a79afe8

Corey Morris commited on

Added introduction, links, and reduced the number of plots displayed
a5fb364

Corey Morris commited on

Modified download CSV feature so that the index column now has a title of model name
6a7ad7c

Corey Morris commited on

Added citation
4fbdb10

Corey Morris commited on

Add dashed line at the appropriate scale of the largest and smallest values on the plot so that plotly still zooms in to show that
7ed3839

Corey Morris commited on

for the custom charts, remove columns with NaN values
b94ee8f

Corey Morris commited on

Refactoring. Moved ResultDataProcessor class to a separate file to make it easier to use with experimentation in a jupyter notebook
843a5ef

Corey Morris commited on

Renamed class. Removed columns that were not useful.
03ade34

Corey Morris commited on

Moved CSV download to just below the table
a34a60b

Corey Morris commited on

Added download CSV button
c90b29a

Corey Morris commited on

Rename
e854cb9

Corey Morris commited on

Added custom scatterplot creation
ca8d4b9

Corey Morris commited on

Improving clarity. Moved MMLU average column to a more appropriate spot
5129f48

Corey Morris commited on

Hiding filters unless box is selected. Removed model name column because it is the index of the table
8488477

Corey Morris commited on

Added a scatter plot with just the top 50 performing models on MMLU average
ca8e784

Corey Morris commited on

added MMLU overall average column. added a few charts comparing more moral reasoning and comparing MMLU overall to other data
c671de9

Corey Morris commited on

Updated data cleanup so that column names are cleaned up appropriatly with regex=True
c1a84da

Corey Morris commited on

added a few charts
ac931c6

Corey Morris commited on

fixed reversed plot. extracted making chart into a method
337b761

Corey Morris commited on

Update app.py and requirements.txt so that it will work with huggingface streamlit with the pandas 1.x version
ba99486

Corey Morris commited on