egonrp commited on
Commit
93ab61b
1 Parent(s): 58c7b0a
Files changed (1) hide show
  1. README.md +12 -1
README.md CHANGED
@@ -42,7 +42,7 @@ This model is a fine-tuned version of [egonrp/gpt2-wikiwriter-medium-portuguese]
42
 
43
  ** It's a chatbot experiment. ;)
44
 
45
- The model was trained in 12 hours on a RTX 3060 12GB.
46
 
47
 
48
  ## Model description
@@ -59,6 +59,17 @@ More information needed
59
 
60
  ## Training procedure
61
 
 
 
 
 
 
 
 
 
 
 
 
62
  ### Training hyperparameters
63
 
64
  The following hyperparameters were used during training:
 
42
 
43
  ** It's a chatbot experiment. ;)
44
 
45
+ The model was trained in 12 hours on a RTX 3060 12GB with training argument "--fp16".
46
 
47
 
48
  ## Model description
 
59
 
60
  ## Training procedure
61
 
62
+ ```
63
+ python3 run_clm.py \
64
+ --model_name_or_path egonrp/gpt2-wikiwriter-medium-portuguese \
65
+ --train_file /home/egon/dev/gptsquad_data/converted_squad_merged_out_v4c.txt \
66
+ --do_train \
67
+ --num_train_epochs 3 \
68
+ --per_device_train_batch_size 1 \
69
+ --output_dir /home/egon/dev/gptsquad_model/results_v4c_medium_no_eval \
70
+ --fp16
71
+ ```
72
+
73
  ### Training hyperparameters
74
 
75
  The following hyperparameters were used during training: