ncoop57 commited on
Commit
e6d8027
1 Parent(s): 81efd44

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -1
README.md CHANGED
@@ -10,7 +10,37 @@ GPT-CC-125M-Code-Search is a [GPT-Neo-125M model](https://huggingface.co/Eleuthe
10
 
11
  ## Training procedure
12
 
13
- The training script used to train this model can be found [here](https://github.com/ncoop57/gpt-code-clippy/blob/camera-ready/training/run_clm_apps.py).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
  ## Intended Use and Limitations
16
 
10
 
11
  ## Training procedure
12
 
13
+ The training script used to train this model can be found [here](https://github.com/ncoop57/gpt-code-clippy/blob/camera-ready/training/run_clm_flax.py).
14
+
15
+ ```bash
16
+ ./run_clm_flax.py \
17
+ --output_dir $HOME/gpt-neo-125M-code-search-py \
18
+ --model_name_or_path="EleutherAI/gpt-neo-125M" \
19
+ --dataset_name code_search_net \
20
+ --dataset_config_name="python" \
21
+ --do_train --do_eval \
22
+ --block_size="512" \
23
+ --per_device_train_batch_size="32" \
24
+ --per_device_eval_batch_size="64" \
25
+ --preprocessing_num_workers="8" \
26
+ --learning_rate="1.2e-4" \
27
+ --num_train_epochs 20 \
28
+ --warmup_steps 3000 \
29
+ --adam_beta1="0.9" \
30
+ --adam_beta2="0.95" \
31
+ --weight_decay="0.1" \
32
+ --overwrite_output_dir \
33
+ --logging_steps="25" \
34
+ --eval_steps="500" \
35
+ --push_to_hub="False" \
36
+ --report_to="all" \
37
+ --dtype="bfloat16" \
38
+ --skip_memory_metrics="True" \
39
+ --save_steps="500" \
40
+ --save_total_limit 10 \
41
+ --report_to="wandb" \
42
+ --run_name="gpt-neo-125M-code-search-py"
43
+ ```
44
 
45
  ## Intended Use and Limitations
46