DarwinAnim8or
commited on
Commit
•
c5bf550
1
Parent(s):
589d48a
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,34 @@
|
|
1 |
---
|
2 |
license: bsd-3-clause
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: bsd-3-clause
|
3 |
+
datasets:
|
4 |
+
- daily_dialog
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
tags:
|
8 |
+
- chat
|
9 |
+
- dialog
|
10 |
+
- natural language
|
11 |
+
- codegen
|
12 |
+
co2_eq_emissions:
|
13 |
+
emissions: 40
|
14 |
+
source: https://mlco2.github.io/impact/#compute
|
15 |
+
training_type: fine-tuning
|
16 |
+
geographical_location: Oregon, USA
|
17 |
+
hardware_used: 1x NVIDIA P100, Kaggle
|
18 |
+
library_name: transformers
|
19 |
---
|
20 |
+
|
21 |
+
# DailyChat-350M
|
22 |
+
A finetuned version of [Codegen-350M-nl](https://huggingface.co/Salesforce/Codegen-350M-nl) on the 'daily_dialog' dataset.
|
23 |
+
The idea of this model is to create one that is capable of holding a decent conversation.
|
24 |
+
|
25 |
+
# Training Procedure
|
26 |
+
This was trained on Kaggle's servers using 1x NVIDIA P100.
|
27 |
+
This model was trained for 1 epoch with learning rate 1e-2.
|
28 |
+
|
29 |
+
# Biases & Limitations
|
30 |
+
This likely contains the same biases and limitations as the original model that it is based on, and additionally heavy biases from the dataset.
|
31 |
+
It *can* generate offensive input when prompted, so user discretion is advised.
|
32 |
+
|
33 |
+
# Intended Use
|
34 |
+
Dialog generation, chat agents.
|