lewtun HF staff commited on
Commit
79a5877
1 Parent(s): 20aaf01

Add description

Browse files
Files changed (1) hide show
  1. app.py +5 -4
app.py CHANGED
@@ -83,10 +83,11 @@ st.title("RAFT Leaderboard")
83
  st.markdown("""
84
  RAFT: Real-world Annotated Few-shot Tasks
85
 
86
- Large pre-trained language models have shown promise for few-shot learning, completing text-based tasks given only a few task-specific examples. Will models soon solve classification tasks that have so far been reserved for human research assistants? RAFT is a few-shot classification benchmark that tests language models
87
- across multiple domains (lit review, tweets, customer interaction, etc.)
88
- on economically valuable classification tasks (someone inherently cares about the task)
89
- in a setting that mirrors deployment (50 examples per task, info retrieval allowed, hidden test set)
 
90
 
91
  To submit to RAFT, follow the instruction posted on [this page](https://github.com/oughtinc/raft_submission).
92
  """)
83
  st.markdown("""
84
  RAFT: Real-world Annotated Few-shot Tasks
85
 
86
+ Large pre-trained language models have shown promise for few-shot learning, completing text-based tasks given only a few task-specific examples. Will models soon solve classification tasks that have so far been reserved for human research assistants? RAFT is a few-shot classification benchmark that tests language models:
87
+
88
+ - across multiple domains (lit review, tweets, customer interaction, etc.)
89
+ - on economically valuable classification tasks (someone inherently cares about the task)
90
+ - in a setting that mirrors deployment (50 examples per task, info retrieval allowed, hidden test set)
91
 
92
  To submit to RAFT, follow the instruction posted on [this page](https://github.com/oughtinc/raft_submission).
93
  """)