CaterinaLac's picture
Update README.md
763a3ff
|
raw
history blame
No virus
325 Bytes
metadata
license: apache-2.0
datasets:
  - shibing624/sharegpt_gpt4
language:
  - en
  - zh
  - fr
  - es

Model Card

Pythia-70m-deduped finetuned on a cleaned version of ShareGPT data.
The cleaned dataset is obtained by removing duplicates and paraphrases from the original corpus. The final training size is of 3494 instances.