Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

764

Creating functions for plotting results over time

#295

by chriscanal - opened Sep 22, 2023

base: refs/heads/main

←

from: refs/pr/295

Discussion Files changed

+1790

-219

Creating functions for plotting results over time319b0b79

chriscanal

Sep 22, 2023

I've added two graphs and human baselines for each metric. I think this should help us track progress over time more easily.

chriscanal

Sep 22, 2023

Once this is approved there is a minor change to app.py that I will open a PR for in order to get these graphs to actually display

SaylorTwift

Open LLM Leaderboard org Sep 22, 2023

Hi, thanks for this contribution, it looks great !! I think however that it would be better to have it displayed in another tab (like the about section). What do you think ?

chriscanal

Sep 22, 2023

•

edited Sep 22, 2023

Sure, hows this?

Added graphs tab1d6addaf

chriscanal

Sep 25, 2023

Any other changes I should make?

SaylorTwift

Open LLM Leaderboard org Sep 26, 2023

Hi ! I just looked at the changes locally. Looks great :)
Is there however, a specific reason why you chose to use non-interactive plots (instead of plotly) ?

Changed to Plotly for interactive graphs!65fc294d

Updated main to include title in the graph function parameterse872e8a1

Added y-axis range to make graph more aesthetically pleasing02700b60

chriscanal

Sep 26, 2023

@SaylorTwift I didn't use plotly because I'm dumb and ignorant. lmao. I tried it out, and it looks soooo much better!

Great idea!

SaylorTwift

Open LLM Leaderboard org Sep 27, 2023

Oh my god it looks so good !! I will try it out locally asap, I think I spotted a reordering of the models in the leaderboard using your PR, can you maybe check that ?

SaylorTwift

Open LLM Leaderboard org Sep 27, 2023

Yes, that's what I thought, the models are reordered. I do not really have time to look into it right now but tell me when the issue is solved !

Fixing bug that messes up the order of models75297e78

chriscanal

Sep 27, 2023

Oops. yeah, I messed that up. I was reordering by date uploaded to figure out the timeline of the scores. I made a copy of the original df to make sure I don't modify the order.

chriscanal

Oct 2, 2023

Anything else I can do to get this merged?

chriscanal

Oct 5, 2023

@SaylorTwift ?

clefourrier

Open LLM Leaderboard org Oct 9, 2023

Hi!
This looks super neat, thank you for your work!

I have a small nit; the tab name is unclear atm, would be good to rename it to something else, maybe "Metrics evolution through time" for example.
Do you also display the scores of flagged models? If yes I don't think they should be included in the graph.

But congrats, it looks very cool, looking forward to having it merged!

Updated app.py to fix conflict and changed name of tab per Clémentine Fourrier's request8e478685

Updated plotted models to exclude flagged models36bf409e

chriscanal

Oct 17, 2023

I updated based on your request @clefourrier . I unfortunately have no way of fixing conflicts. I'm using the huggingface web ui to write the code, and I can't run locally because my local doesn't have permissions to run the code anymore due to meta-llama/Llama-2-70b-chat-hf permissions. I think the conflicts would be very minor and take max 30 seconds to fix though looking at whats currently in main.

Merge branch 'main' into pr/29581c33130

clefourrier

Open LLM Leaderboard org Oct 18, 2023

I merged main to your branch to fix the issues :) (info on how to do it in the cmd line is here for a possible next time).

Thank you very much for your work!!!

clefourrier changed pull request status to merged Oct 18, 2023

clefourrier

Open LLM Leaderboard org Nov 9, 2023

Hi @chriscanal
FYI we deactivated your cool tab for now, because we are updating the front to make it more maintainable, but we'll add it back as soon as the front end is upgraded (ETA next week most likely) 🤗

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment