Update MODEL_SUMMARY.txt - Run 20251013_004438
Browse files
weights/David-decoupled-deep_efficiency/20251013_004438/MODEL_SUMMARY.txt
ADDED
|
@@ -0,0 +1,62 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 3 |
+
β DAVID MODEL SUMMARY β
|
| 4 |
+
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ£
|
| 5 |
+
β β
|
| 6 |
+
β π― VALIDATION ACCURACY: 58.40% β
|
| 7 |
+
β β
|
| 8 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 9 |
+
|
| 10 |
+
MODEL: David-decoupled-deep_efficiency
|
| 11 |
+
RUN ID: 20251013_004438
|
| 12 |
+
BEST EPOCH: 1/10
|
| 13 |
+
|
| 14 |
+
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 15 |
+
|
| 16 |
+
π PERFORMANCE BREAKDOWN
|
| 17 |
+
|
| 18 |
+
Final Training Accuracy: 51.66%
|
| 19 |
+
Best Validation Accuracy: 58.40%
|
| 20 |
+
|
| 21 |
+
Per-Scale Accuracies:
|
| 22 |
+
β’ Scale 128: 58.40%
|
| 23 |
+
β’ Scale 256: 67.03%
|
| 24 |
+
β’ Scale 384: 69.55%
|
| 25 |
+
β’ Scale 448: 70.34%
|
| 26 |
+
β’ Scale 512: 70.84%
|
| 27 |
+
β’ Scale 576: 71.29%
|
| 28 |
+
β’ Scale 640: 71.60%
|
| 29 |
+
β’ Scale 768: 72.03%
|
| 30 |
+
β’ Scale 896: 72.25%
|
| 31 |
+
|
| 32 |
+
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 33 |
+
|
| 34 |
+
ποΈ ARCHITECTURE
|
| 35 |
+
|
| 36 |
+
Preset: gated_expert_team
|
| 37 |
+
Sharing Mode: decoupled
|
| 38 |
+
Fusion Mode: deep_efficiency
|
| 39 |
+
Scales: 9 scales - [128, 256, 384, 448, 512, 576, 640, 768, 896]
|
| 40 |
+
Feature Dim: 512
|
| 41 |
+
Parameters: 22,133,801
|
| 42 |
+
|
| 43 |
+
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 44 |
+
|
| 45 |
+
π TRAINING CURVE
|
| 46 |
+
|
| 47 |
+
Epoch | Train Acc | Val Acc | Learning Rate
|
| 48 |
+
------|-----------|----------|--------------
|
| 49 |
+
1 | 51.66% | 58.40% π | 9.76e-03
|
| 50 |
+
|
| 51 |
+
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 52 |
+
|
| 53 |
+
π FILES
|
| 54 |
+
|
| 55 |
+
Best Model: best_model_acc58.40.safetensors
|
| 56 |
+
Config: david_config.json
|
| 57 |
+
Training Cfg: train_config.json
|
| 58 |
+
History: training_history.json
|
| 59 |
+
|
| 60 |
+
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 61 |
+
|
| 62 |
+
Generated: 2025-10-13 00:49:34
|