Spaces:

AILab-CVC
/

SEED-Bench_Leaderboard

Running

App Files Files Community

tttoaster commited on Aug 16, 2023

Commit

29fa0ea

•

1 Parent(s): e5f747b

Update constants.py

Browse files

Files changed (1) hide show

constants.py +13 -2

constants.py CHANGED Viewed

@@ -39,11 +39,22 @@ SUBMIT_INTRODUCTION = """# Submit Precautions
     1. Attain JSON file from our [github repository](https://github.com/AILab-CVC/SEED-Bench) after evaluation. For example, you can obtain InstructBLIP's JSON file as results/results.json after running
     ```shell
     python eval.py --model instruct_blip --anno_path SEED-Bench.json --output-dir results
-     ```
-    2. If you want to revise a model, please ensure 'Revision Model Name' align with what's in the leaderboard. For example, if you want to modify InstructBLIP's evaluation result, you need to fill in 'InstructBLIP' in 'Revision Model Name'.
     3. Please ensure the right link for each submission. Everyone could go to the model's repository through the model name in the leaderboard.
     4. If you don't want to evaluate all dimensions, not evaluated dimension performance, and its corresponding average performance will be set to 0.
     5. After clicking 'Submit Eval', you can click 'Refresh' to obtain the latest leaderboard.
 """
 TABLE_INTRODUCTION = """In the table below, we summarize each task performance of all the models.

     1. Attain JSON file from our [github repository](https://github.com/AILab-CVC/SEED-Bench) after evaluation. For example, you can obtain InstructBLIP's JSON file as results/results.json after running
     ```shell
     python eval.py --model instruct_blip --anno_path SEED-Bench.json --output-dir results
+    ```
+    2. If you want to revise a model, please ensure 'Model Name Revision' align with what's in the leaderboard. For example, if you want to modify InstructBLIP's evaluation result, you need to fill in 'InstructBLIP' in 'Revision Model Name'.
     3. Please ensure the right link for each submission. Everyone could go to the model's repository through the model name in the leaderboard.
     4. If you don't want to evaluate all dimensions, not evaluated dimension performance, and its corresponding average performance will be set to 0.
     5. After clicking 'Submit Eval', you can click 'Refresh' to obtain the latest leaderboard.
+    ## Submit Example
+    For example, if you want to revise InstructBLIP's performance in the leaderboard, you need to:
+    (1). Fill in 'InstructBLIP' in 'Revision Model Name'.
+    (2). Select 'ImageLLM' in 'Model Type'.
+    (3). Fill in 'https://github.com/salesforce/LAVIS' in 'Model Link'.
+    (4). Select 'Flan-T5-XL' in 'LLM Type'.
+    (5). Select 'All' in 'Evaluation Dimension'.
+    (6). Upload results.json.
+    (7). Click the 'Submit Eval' button.
+    (8). Click 'Refresh' to obtain the uploaded leaderboard.
 """
 TABLE_INTRODUCTION = """In the table below, we summarize each task performance of all the models.