Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
File size: 1,559 Bytes
4445ad2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
version: 0.0.4
tasks:
oab_exams:
benchmark: oab_exams
col_name: OAB Exams
task_list:
- oab_exams_generate
metric: exact_match
few_shot: 5
limit: null
baseline: 25.0
human_baseline: 50.0
description: OAB Exams is a dataset of 2,000 questions from the Brazilian Bar
Association's exams.
link: https://huggingface.co/datasets/eduagarcia/oab_exams
brazilian_court_decisions_judgment:
benchmark: brazilian_court_decisions_judgment
col_name: BR Court Decisions
task_list:
- brazilian_court_decisions_judgment_generate
metric: f1_macro
few_shot: 5
limit: null
baseline: 33.33
human_baseline: 100.0
description: A classification dataset of court decisions from the Tribunal de
Justiça de Alagoas (TJAL, the State Supreme Court of Alagoas (Brazil).
link: https://huggingface.co/datasets/joelniklaus/brazilian_court_decisions
datalawyer_frases:
benchmark: datalawyer_frases
col_name: DL Frases
task_list:
- datalawyer_frases_generate
metric: f1_macro
few_shot: 15
limit: 2000
baseline: 10.0
human_baseline: 100.0
description: A classification dataset
link: https://huggingface.co/datasets/eduagarcia/portuguese_benchmark
rrip:
benchmark: rrip
col_name: RRIP
task_list:
- rrip_generate
metric: f1_macro
few_shot: 15
limit: null
baseline: 12.5
human_baseline: 100.0
description: A classification dataset
link: https://huggingface.co/datasets/eduagarcia/portuguese_benchmark
|