Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit
·
3955552
1
Parent(s):
7272259
update links
Browse files
dabstep_benchmark/content.py
CHANGED
@@ -1,10 +1,16 @@
|
|
1 |
TITLE = """# 🏅 DABStep Leaderboard"""
|
2 |
|
3 |
INTRODUCTION_TEXT = """
|
4 |
-
The Data Agent Benchmark for Multi-step Reasoning (DABStep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
|
5 |
The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
|
6 |
|
7 |
We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
"""
|
9 |
|
10 |
SUBMISSION_TEXT = """
|
|
|
1 |
TITLE = """# 🏅 DABStep Leaderboard"""
|
2 |
|
3 |
INTRODUCTION_TEXT = """
|
4 |
+
The [Data Agent Benchmark for Multi-step Reasoning (DABStep)](https://huggingface.co/blog/dabstep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
|
5 |
The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
|
6 |
|
7 |
We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
|
8 |
+
|
9 |
+
Check out the official technical reports here:
|
10 |
+
[Adyen Report](https://www.adyen.com/knowledge-hub/data-agent-benchmark-for-multi-step-reasoning-dabstep)
|
11 |
+
[Hugging Face Report](https://huggingface.co/blog/dabstep)
|
12 |
+
|
13 |
+
Join the discussion on the [discord server!](https://discord.gg/zJSVKmRy)
|
14 |
"""
|
15 |
|
16 |
SUBMISSION_TEXT = """
|