martinigoyanes commited on
Commit
3955552
·
1 Parent(s): 7272259

update links

Browse files
Files changed (1) hide show
  1. dabstep_benchmark/content.py +7 -1
dabstep_benchmark/content.py CHANGED
@@ -1,10 +1,16 @@
1
  TITLE = """# 🏅 DABStep Leaderboard"""
2
 
3
  INTRODUCTION_TEXT = """
4
- The Data Agent Benchmark for Multi-step Reasoning (DABStep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
5
  The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
6
 
7
  We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
 
 
 
 
 
 
8
  """
9
 
10
  SUBMISSION_TEXT = """
 
1
  TITLE = """# 🏅 DABStep Leaderboard"""
2
 
3
  INTRODUCTION_TEXT = """
4
+ The [Data Agent Benchmark for Multi-step Reasoning (DABStep)](https://huggingface.co/blog/dabstep) is looking to measure and push the state-of-the-art in Data Analysis by LLMs.
5
  The benchmark is composed of ~450 data analysis questions ([Dataset Link](https://huggingface.co/datasets/adyen/data-agents-benchmark)) centered around 1 or more documents that agents will have to understand and cross reference in order to answer correctly.
6
 
7
  We have set up a notebook to quickly get an agent baseline using the free Huggingface Inference API: [Colab Notebook](https://colab.research.google.com/drive/1pXi5ffBFNJQ5nn1111SnIfjfKCOlunxu)
8
+
9
+ Check out the official technical reports here:
10
+ [Adyen Report](https://www.adyen.com/knowledge-hub/data-agent-benchmark-for-multi-step-reasoning-dabstep)
11
+ [Hugging Face Report](https://huggingface.co/blog/dabstep)
12
+
13
+ Join the discussion on the [discord server!](https://discord.gg/zJSVKmRy)
14
  """
15
 
16
  SUBMISSION_TEXT = """