End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -14,10 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pszemraj/eduscore-regression/runs/8e2uvp5t)
 # distilroberta-base-fineweb-edu-llama3-annotations-2048-vN
-This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2197
-- Mse: 0.2197
 ## Model description

 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pszemraj/eduscore-regression/runs/8e2uvp5t)
 # distilroberta-base-fineweb-edu-llama3-annotations-2048-vN
+This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the HuggingFaceFW/fineweb-edu-llama3-annotations dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2194
+- Mse: 0.2194
 ## Model description

all_results.json ADDED Viewed

+{
+    "epoch": 0.9999279383151978,
+    "eval_loss": 0.2193743884563446,
+    "eval_mse": 0.2193743917504471,
+    "eval_runtime": 3.5292,
+    "eval_samples": 1000,
+    "eval_samples_per_second": 283.348,
+    "eval_steps_per_second": 17.851,
+    "total_flos": 5.881871499303322e+16,
+    "train_loss": 0.2843814373497456,
+    "train_runtime": 1907.7343,
+    "train_samples": 444052,
+    "train_samples_per_second": 232.764,
+    "train_steps_per_second": 1.818
+}

eval_results.json ADDED Viewed

+{
+    "epoch": 0.9999279383151978,
+    "eval_loss": 0.2193743884563446,
+    "eval_mse": 0.2193743917504471,
+    "eval_runtime": 3.5292,
+    "eval_samples": 1000,
+    "eval_samples_per_second": 283.348,
+    "eval_steps_per_second": 17.851
+}

train_results.json ADDED Viewed

+{
+    "epoch": 0.9999279383151978,
+    "total_flos": 5.881871499303322e+16,
+    "train_loss": 0.2843814373497456,
+    "train_runtime": 1907.7343,
+    "train_samples": 444052,
+    "train_samples_per_second": 232.764,
+    "train_steps_per_second": 1.818
+}

trainer_state.json ADDED Viewed

The diff for this file is too large to render. See raw diff