pszemraj commited on
Commit
06065c2
1 Parent(s): 123376b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -51,6 +51,8 @@ widget:
51
  pipeline_tag: text-generation
52
  datasets:
53
  - BEE-spoke-data/UltraTextbooks-2.1-fw_mix
 
 
54
  ---
55
 
56
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -61,7 +63,7 @@ should probably proofread and complete it, then remove this comment. -->
61
 
62
  ## Model description
63
 
64
- This model is a fine-tuned version of [pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py](https://hf.co/pszemraj/mega-ar-350m-L3t-v0.07-cosmo_webmath_py) on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
65
  It achieves the following results on the evaluation set:
66
  - Loss: 2.0787
67
  - Accuracy: 0.5746
@@ -73,8 +75,8 @@ It achieves the following results on the evaluation set:
73
  Quick eval for: pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw
74
 
75
 
76
- bootstrapping for stddev: perplexity
77
  hf (pretrained=pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: 0.99999, num_fewshot: None, batch_size: 8
 
78
  | Tasks |Version|Filter|n-shot| Metric | Value | |Stderr|
79
  |--------------|------:|------|-----:|----------|------:|---|-----:|
80
  |arc_easy | 1|none | 0|acc | 0.4246|± |0.0139|
 
51
  pipeline_tag: text-generation
52
  datasets:
53
  - BEE-spoke-data/UltraTextbooks-2.1-fw_mix
54
+ language:
55
+ - en
56
  ---
57
 
58
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
63
 
64
  ## Model description
65
 
66
+ This is a pretraining experiment most recently trained on the BEE-spoke-data/UltraTextbooks-2.1-fw_mix dataset.
67
  It achieves the following results on the evaluation set:
68
  - Loss: 2.0787
69
  - Accuracy: 0.5746
 
75
  Quick eval for: pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw
76
 
77
 
 
78
  hf (pretrained=pszemraj/mega-ar-350m-L3t-v0.08-ultraTBfw,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: 0.99999, num_fewshot: None, batch_size: 8
79
+
80
  | Tasks |Version|Filter|n-shot| Metric | Value | |Stderr|
81
  |--------------|------:|------|-----:|----------|------:|---|-----:|
82
  |arc_easy | 1|none | 0|acc | 0.4246|± |0.0139|