Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,7 @@ datasets:
|
|
44 |
The LongForm dataset is created by leveraging English corpus examples with reverse instructions. We select a diverse set of human-written documents from existing corpora such as C4 and Wikipedia and generate instructions for the given documents via LLMs. Then, we extend these examples with structured corpora examples such as Stack Exchange and WikiHow and task examples such as question answering, email writing, grammar error correction, story/poem generation, and text summarization.
|
45 |
|
46 |
Github Repo: https://github.com/akoksal/LongForm
|
|
|
47 |
|
48 |
### For OPT and LLaMA models: Use [EOI] to indicate the end of instruction.
|
49 |
|
|
|
44 |
The LongForm dataset is created by leveraging English corpus examples with reverse instructions. We select a diverse set of human-written documents from existing corpora such as C4 and Wikipedia and generate instructions for the given documents via LLMs. Then, we extend these examples with structured corpora examples such as Stack Exchange and WikiHow and task examples such as question answering, email writing, grammar error correction, story/poem generation, and text summarization.
|
45 |
|
46 |
Github Repo: https://github.com/akoksal/LongForm
|
47 |
+
![The LongForm dataset](https://github.com/akoksal/LongForm/blob/main/figures/intro_example.jpg?raw=true)
|
48 |
|
49 |
### For OPT and LLaMA models: Use [EOI] to indicate the end of instruction.
|
50 |
|