DailyChat-350M
A finetuned version of Codegen-350M-nl on the 'daily_dialog' dataset. The idea of this model is to create one that is capable of holding a decent conversation.
Training Procedure
This was trained on Kaggle's servers using 1x NVIDIA P100. This model was trained for 1 epoch with learning rate 1e-2.
Biases & Limitations
This likely contains the same biases and limitations as the original model that it is based on, and additionally heavy biases from the dataset. It can generate offensive input when prompted, so user discretion is advised.
Intended Use
Dialog generation, chat agents.
- Downloads last month
- 20
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.