Spaces:

adyen
/

DABstep

Running on CPU Upgrade

martinigoyanes commited on Feb 9

Commit

3955552

1 Parent(s): 7272259

update links

Files changed (1) hide show

dabstep_benchmark/content.py CHANGED Viewed

@@ -1,10 +1,16 @@
 TITLE = """# 🏅 DABStep Leaderboard"""
 INTRODUCTION_TEXT = """
-The Data Agent Benchmark for Multi-step Reasoning (DABStep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
 The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
 We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
 """
 SUBMISSION_TEXT = """

 TITLE = """# 🏅 DABStep Leaderboard"""
 INTRODUCTION_TEXT = """
+The [Data Agent Benchmark for Multi-step Reasoning (DABStep)](https://huggingface.co/blog/dabstep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
 The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
 We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
+Check out the official technical reports here:
+[Adyen Report](https://www.adyen.com/knowledge-hub/data-agent-benchmark-for-multi-step-reasoning-dabstep)
+[Hugging Face Report](https://huggingface.co/blog/dabstep)
+Join the discussion on the [discord server!](https://discord.gg/zJSVKmRy)
 """
 SUBMISSION_TEXT = """