haielab
/

STP_model_Lean_0320-conjecture-base-FineTune-new-config

Text Generation

theorem-proving

Model card Files Files and versions

haielab commited on Jul 16

Commit

4258da5

·

verified ·

1 Parent(s): a00009d

Update README.md

Files changed (1) hide show

README.md +26 -0

README.md CHANGED Viewed

@@ -39,8 +39,34 @@ LoRA rank-16 adapter fine-tuned from **`kfdong/STP_model_Lean_0320`** to assist
 | **Context length** | 1792 tokens |
 | **Hardware** | 1 × H100 80 GB |
 ---
 ### Results  (1 epoch, STP Lean corpus)
 | Metric                    | Value |

 | **Context length** | 1792 tokens |
 | **Hardware** | 1 × H100 80 GB |
 ---
+#### Training Hyperparameters
+| Setting                       | Value                                       |
+|-------------------------------|---------------------------------------------|
+| Precision / regime            | **bf16 mixed precision**                    |
+| Epochs                        | **1**                                       |
+| Max sequence length           | 1 792 tokens (right-padding)                |
+| Per-device train batch size   | 6                                           |
+| Per-device eval batch size    | 2                                           |
+| Gradient accumulation steps   | 1 (effective batch = 6)                     |
+| Optimizer                     | AdamW                                       |
+| Learning rate schedule        | **2 × 10⁻⁴** cosine, warm-up **3 %**        |
+| Weight decay                  | 0.01                                        |
+| LoRA rank / α / dropout       | r = 16, α = 32 (2 × r), dropout = 0.05      |
+| Gradient checkpointing        | Enabled (memory-efficient)                  |
+| Flash-Attention v2            | Enabled                                     |
+| Logging                       | every 50 steps                              |
+| Evaluation strategy           | once per epoch                              |
+| Save strategy                 | once per epoch                              |
+| Seed                          | 42                                          |
+| Hardware                      | 1 × H100 80 GB             |
+---
 ### Results  (1 epoch, STP Lean corpus)
 | Metric                    | Value |