BEE-spoke-data
/

mega-ar-350m-L3t-v0.08-ultraTBfw

Text Generation

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on May 8

Commit

123376b

•

1 Parent(s): 6d0e532

Update README.md

Files changed (1) hide show

README.md +20 -7

README.md CHANGED Viewed

@@ -56,7 +56,10 @@ datasets:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mega-ar-350m-L3t-v0.07-cosmo_webmath_py-UltraTextbooks-2.1-fw_mix-vN
 This model is a fine-tuned version of [pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py](https://hf.co/pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py) on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
 It achieves the following results on the evaluation set:
@@ -64,17 +67,27 @@ It achieves the following results on the evaluation set:
 - Accuracy: 0.5746
 - Num Input Tokens Seen: 3492282368
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mega-ar-350m-L3t-v0.08-ultraTBfw
+## Model description
 This model is a fine-tuned version of [pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py](https://hf.co/pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py) on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
 It achieves the following results on the evaluation set:
 - Accuracy: 0.5746
 - Num Input Tokens Seen: 3492282368
+## Quick eval
+Quick eval for:	pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw
+bootstrapping for stddev: perplexity
+hf (pretrained=pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: 0.99999, num_fewshot: None, batch_size: 8
+|    Tasks     |Version|Filter|n-shot|  Metric  | Value |   |Stderr|
+|--------------|------:|------|-----:|----------|------:|---|-----:|
+|arc_easy      |      1|none  |     0|acc       | 0.4246|±  |0.0139|
+|              |       |none  |     0|acc_norm  | 0.4002|±  |0.0138|
+|boolq         |      2|none  |     0|acc       | 0.5762|±  |0.0139|
+|lambada_openai|      1|none  |     0|perplexity|76.7162|±  |6.3531|
+|              |       |none  |     0|acc       | 0.2605|±  |0.0123|
+|openbookqa    |      1|none  |     0|acc       | 0.1840|±  |0.0173|
+|              |       |none  |     0|acc_norm  | 0.2720|±  |0.0199|
+|piqa          |      1|none  |     0|acc       | 0.6377|±  |0.0135|
+|              |       |none  |     0|acc_norm  | 0.6172|±  |0.0137|
+|winogrande    |      1|none  |     0|acc       | 0.5020|±  |0.0141|
 ## Training procedure