ajustes
Browse files
README.md
CHANGED
@@ -42,7 +42,7 @@ This model is a fine-tuned version of [egonrp/gpt2-wikiwriter-medium-portuguese]
|
|
42 |
|
43 |
** It's a chatbot experiment. ;)
|
44 |
|
45 |
-
The model was trained in 12 hours on a RTX 3060 12GB.
|
46 |
|
47 |
|
48 |
## Model description
|
@@ -59,6 +59,17 @@ More information needed
|
|
59 |
|
60 |
## Training procedure
|
61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
### Training hyperparameters
|
63 |
|
64 |
The following hyperparameters were used during training:
|
|
|
42 |
|
43 |
** It's a chatbot experiment. ;)
|
44 |
|
45 |
+
The model was trained in 12 hours on a RTX 3060 12GB with training argument "--fp16".
|
46 |
|
47 |
|
48 |
## Model description
|
|
|
59 |
|
60 |
## Training procedure
|
61 |
|
62 |
+
```
|
63 |
+
python3 run_clm.py \
|
64 |
+
--model_name_or_path egonrp/gpt2-wikiwriter-medium-portuguese \
|
65 |
+
--train_file /home/egon/dev/gptsquad_data/converted_squad_merged_out_v4c.txt \
|
66 |
+
--do_train \
|
67 |
+
--num_train_epochs 3 \
|
68 |
+
--per_device_train_batch_size 1 \
|
69 |
+
--output_dir /home/egon/dev/gptsquad_model/results_v4c_medium_no_eval \
|
70 |
+
--fp16
|
71 |
+
```
|
72 |
+
|
73 |
### Training hyperparameters
|
74 |
|
75 |
The following hyperparameters were used during training:
|