CaterinaLac's picture
Update README.md
763a3ff
|
raw
history blame
No virus
325 Bytes
---
license: apache-2.0
datasets:
- shibing624/sharegpt_gpt4
language:
- en
- zh
- fr
- es
---
# Model Card
Pythia-70m-deduped finetuned on a cleaned version of ShareGPT data.
<br>The cleaned dataset is obtained by removing duplicates and paraphrases from the original corpus. The final training size is of 3494 instances.