Update README.md
Browse files
README.md
CHANGED
@@ -25,9 +25,9 @@ The model was evaluated on a held-out sample from the STAR-QA dataset (see below
|
|
25 |
|
26 |
## Training Data
|
27 |
|
28 |
-
The model was fine-tuned
|
29 |
|
30 |
-
The resulting question and its associated ground-truth answer (collectively a "pair") constitute a single training example for the fine-tuning step.
|
31 |
|
32 |
## Training
|
33 |
The model was fine-tuned with the parameters:
|
|
|
25 |
|
26 |
## Training Data
|
27 |
|
28 |
+
The model was fine-tuned on a corpus of audit, risk-management, compliance and associated regulatory documents sourced from the public internet. Documents were cleaned and chunked into 2-sentence blocks. Each block was then sent to a state-of-the-art LLM with the following prompt: "Write a question about {document_topic} for which this is the answer: {block}"
|
29 |
|
30 |
+
The resulting question and its associated ground-truth answer (collectively a "pair") constitute a single training example for the fine-tuning step. The final model was fine-tuned on ~18K such pairs.
|
31 |
|
32 |
## Training
|
33 |
The model was fine-tuned with the parameters:
|