Update README.md
Browse files
README.md
CHANGED
@@ -51,6 +51,8 @@ widget:
|
|
51 |
pipeline_tag: text-generation
|
52 |
datasets:
|
53 |
- BEE-spoke-data/UltraTextbooks-2.1-fw_mix
|
|
|
|
|
54 |
---
|
55 |
|
56 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -61,7 +63,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
61 |
|
62 |
## Model description
|
63 |
|
64 |
-
This
|
65 |
It achieves the following results on the evaluation set:
|
66 |
- Loss: 2.0787
|
67 |
- Accuracy: 0.5746
|
@@ -73,8 +75,8 @@ It achieves the following results on the evaluation set:
|
|
73 |
Quick eval for: pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw
|
74 |
|
75 |
|
76 |
-
bootstrapping for stddev: perplexity
|
77 |
hf (pretrained=pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: 0.99999, num_fewshot: None, batch_size: 8
|
|
|
78 |
| Tasks |Version|Filter|n-shot| Metric | Value | |Stderr|
|
79 |
|--------------|------:|------|-----:|----------|------:|---|-----:|
|
80 |
|arc_easy | 1|none | 0|acc | 0.4246|± |0.0139|
|
|
|
51 |
pipeline_tag: text-generation
|
52 |
datasets:
|
53 |
- BEE-spoke-data/UltraTextbooks-2.1-fw_mix
|
54 |
+
language:
|
55 |
+
- en
|
56 |
---
|
57 |
|
58 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
63 |
|
64 |
## Model description
|
65 |
|
66 |
+
This is a pretraining experiment most recently trained on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
|
67 |
It achieves the following results on the evaluation set:
|
68 |
- Loss: 2.0787
|
69 |
- Accuracy: 0.5746
|
|
|
75 |
Quick eval for: pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw
|
76 |
|
77 |
|
|
|
78 |
hf (pretrained=pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: 0.99999, num_fewshot: None, batch_size: 8
|
79 |
+
|
80 |
| Tasks |Version|Filter|n-shot| Metric | Value | |Stderr|
|
81 |
|--------------|------:|------|-----:|----------|------:|---|-----:|
|
82 |
|arc_easy | 1|none | 0|acc | 0.4246|± |0.0139|
|