MichaelFFan commited on
Commit
90276f3
1 Parent(s): f94939a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -4
README.md CHANGED
@@ -19,11 +19,18 @@ This model is a fine-tuned version of `meta-llama/Llama-3.2-1B`, adapted using D
19
  - **License:** MIT
20
  - **Finetuned from model:** meta-llama/Llama-3.2-1B
21
 
22
- ### Model Sources
 
 
 
 
 
 
 
 
 
 
23
 
24
- - **Repository:** [Link to the Hugging Face model repo]
25
- - **Paper [optional]:** [If applicable, link to related research or documentation]
26
- - **Demo [optional]:** [If a demo exists, provide a link]
27
 
28
  ## Uses
29
 
 
19
  - **License:** MIT
20
  - **Finetuned from model:** meta-llama/Llama-3.2-1B
21
 
22
+ ## Training Hyperparameters
23
+
24
+ - **Training regime:** Mixed precision (fp16)
25
+ - **Learning rate:** 2e-4
26
+ - **Batch size:** 8
27
+ - **Number of epochs:** 3
28
+ - **Optimizer:** AdamW with 8-bit precision
29
+ - **Max sequence length:** 512 tokens
30
+ - **Warmup steps:** 100
31
+ - **Weight decay:** 0.01
32
+
33
 
 
 
 
34
 
35
  ## Uses
36