uer commited on
Commit
4607e23
1 Parent(s): 82f9aa4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -3
README.md CHANGED
@@ -62,12 +62,20 @@ python3 pretrain.py --dataset_path couplet.pt \
62
  --output_model_path models/couplet_gpt_base_model.bin \
63
  --config_path models/bert_base_config.json --learning_rate 5e-4 \
64
  --tie_weight --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
65
- --batch_size 64 --report_steps 1000 \
66
- --save_checkpoint_steps 5000 --total_steps 25000 \
67
- --embedding gpt --encoder gpt2 --target lm
 
68
 
69
  ```
70
 
 
 
 
 
 
 
 
71
  ### BibTeX entry and citation info
72
 
73
  ```
 
62
  --output_model_path models/couplet_gpt_base_model.bin \
63
  --config_path models/bert_base_config.json --learning_rate 5e-4 \
64
  --tie_weight --world_size 8 --gpu_ranks 0 1 2 3 4 5 6 7 \
65
+ --embedding word_pos --remove_embedding_layernorm \
66
+ --encoder transformer --mask causal --target lm \
67
+ --layernorm_positioning pre --batch_size 64 --report_steps 1000 \
68
+ --save_checkpoint_steps 5000 --total_steps 25000
69
 
70
  ```
71
 
72
+ Finally, we convert the pre-trained model into Huggingface's format:
73
+ ```
74
+ python3 scripts/convert_gpt2_from_uer_to_huggingface.py --input_model_path couplet_gpt_base_model.bin \
75
+ --output_model_path pytorch_model.bin \
76
+ --layers 12
77
+ ```
78
+
79
  ### BibTeX entry and citation info
80
 
81
  ```