Edit model card

DailyChat-350M

A finetuned version of Codegen-350M-nl on the 'daily_dialog' dataset. The idea of this model is to create one that is capable of holding a decent conversation.

Training Procedure

This was trained on Kaggle's servers using 1x NVIDIA P100. This model was trained for 1 epoch with learning rate 1e-2.

Biases & Limitations

This likely contains the same biases and limitations as the original model that it is based on, and additionally heavy biases from the dataset. It can generate offensive input when prompted, so user discretion is advised.

Intended Use

Dialog generation, chat agents.

Downloads last month
12
Safetensors
Model size
441M params
Tensor type
F32
·
BOOL
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train DarwinAnim8or/DailyChat-350M