pszemraj
/

mega-ar-350m-v0.13

Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on May 14

Commit

9af4293

•

1 Parent(s): d913048

Update README.md

Files changed (1) hide show

README.md +22 -14

README.md CHANGED Viewed

@@ -1,19 +1,17 @@
 ---
 license: apache-2.0
-base_model: pszemraj/mega-ar-350m-v0.12-napierone_epub
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
-model-index:
-- name: mega-ar-350m-v0.12-napierone_epub-UltraTextbooks-2.1-fw_mix-vN
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# mega-ar-350m-v0.12-napierone_epub-UltraTextbooks-2.1-fw_mix-vN
 This model is a fine-tuned version of [pszemraj/mega-ar-350m-v0.12-napierone_epub](https://huggingface.co/pszemraj/mega-ar-350m-v0.12-napierone_epub) on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
 It achieves the following results on the evaluation set:
@@ -21,17 +19,27 @@ It achieves the following results on the evaluation set:
 - Accuracy: 0.5885
 - Num Input Tokens Seen: 3468165120
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -85,4 +93,4 @@ The following hyperparameters were used during training:
 - Transformers 4.40.2
 - Pytorch 2.2.0+cu121
 - Datasets 2.19.1
-- Tokenizers 0.19.1

 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
+language:
+- en
 ---
+# mega-ar-350m-v0.13
+## Model description
 This model is a fine-tuned version of [pszemraj/mega-ar-350m-v0.12-napierone_epub](https://huggingface.co/pszemraj/mega-ar-350m-v0.12-napierone_epub) on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
 It achieves the following results on the evaluation set:
 - Accuracy: 0.5885
 - Num Input Tokens Seen: 3468165120
+## Quick eval
+Quick eval for:	pszemraj/mega-ar-350m-v0.13
+hf (pretrained=pszemraj/mega-ar-350m-v0.13,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 8
+|    Tasks     |Version|Filter|n-shot|  Metric  | Value |   |Stderr|
+|--------------|------:|------|-----:|----------|------:|---|-----:|
+|arc_easy      |      1|none  |     0|acc       | 0.4491|±  |0.0102|
+|              |       |none  |     0|acc_norm  | 0.4061|±  |0.0101|
+|boolq         |      2|none  |     0|acc       | 0.5367|±  |0.0087|
+|lambada_openai|      1|none  |     0|perplexity|55.3308|±  |2.3100|
+|              |       |none  |     0|acc       | 0.3113|±  |0.0065|
+|openbookqa    |      1|none  |     0|acc       | 0.1760|±  |0.0170|
+|              |       |none  |     0|acc_norm  | 0.2680|±  |0.0198|
+|piqa          |      1|none  |     0|acc       | 0.6366|±  |0.0112|
+|              |       |none  |     0|acc_norm  | 0.6213|±  |0.0113|
+|winogrande    |      1|none  |     0|acc       | 0.5036|±  |0.0141|
 ## Training procedure
 - Transformers 4.40.2
 - Pytorch 2.2.0+cu121
 - Datasets 2.19.1
+- Tokenizers 0.19.1