OpenAssistant
/

pythia-12b-pre-v8-12.5k-steps

Text Generation Transformers PyTorch gpt_neox Inference Endpoints text-generation-inference

Model card Files Files and versions Community

andreaskoepf commited on May 13, 2023

Commit

39f6fc8

•

1 Parent(s): 934bda9

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -6,6 +6,7 @@ license: apache-2.0
 This is an intermediate model used as base-model for further pythia 12b SFT-8 experiments.
 It was trained on a wider set of instruction-tuning datasets for >12.5k steps with batch-size 128 and a context size of 2048.
 - wandb: https://wandb.ai/open-assistant/supervised-finetuning/runs/sytsyhrp

 This is an intermediate model used as base-model for further pythia 12b SFT-8 experiments.
 It was trained on a wider set of instruction-tuning datasets for >12.5k steps with batch-size 128 and a context size of 2048.
+The gpt4all dataset had "as a language model" *contamination* (>1.8k entries). We added filtering later, but this model (pre-v8) was trained on the raw unfildered gpt4all dataset.
 - wandb: https://wandb.ai/open-assistant/supervised-finetuning/runs/sytsyhrp