miguelcarv
/

phi-1_5-slimorca

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

miguelcarv commited on Jan 17

Commit

d251b72

•

1 Parent(s): 1d37eee

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -44,8 +44,9 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ## Training Details
  - Trained for one epoch on SlimOrca-Dedup
- - Learning rate: 1e-5
  - Optimizer: AdamW
- - Effective batch size: 64
- - Gradient accumulation steps (mini batch size): 16 (4)
  - Trained with FP32

 ## Training Details
  - Trained for one epoch on SlimOrca-Dedup
+ - Learning rate: 2e-5
+ - Cosine learning rate decay to 0
  - Optimizer: AdamW
+ - Effective batch size: 256
+ - Gradient accumulation steps (mini batch size): 64 (4)
  - Trained with FP32