Edit model card

DailyChat-350M

A finetuned version of Codegen-350M-nl on the 'daily_dialog' dataset. The idea of this model is to create one that is capable of holding a decent conversation.

Training Procedure

This was trained on Kaggle's servers using 1x NVIDIA P100. This model was trained for 1 epoch with learning rate 1e-2.

Biases & Limitations

This likely contains the same biases and limitations as the original model that it is based on, and additionally heavy biases from the dataset. It can generate offensive input when prompted, so user discretion is advised.

Intended Use

Dialog generation, chat agents.

Downloads last month
95
Safetensors
Model size
441M params
Tensor type
F32
·
BOOL
·

Dataset used to train DarwinAnim8or/DailyChat-350M