fblgit
/

UNAversal-2x7B-v1

Text Generation

Generated from Trainer

Inference Endpoints

text-generation-inference

Model card Files Files and versions

fblgit commited on Jan 9

Commit

514783c

•

1 Parent(s): 65db1c5

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
-license: other
-library_name: peft
 tags:
 - llama-factory
 - lora
@@ -11,6 +10,10 @@ model-index:
 ---
 # UNAversal-2x7B-v1
 |    Tasks     |Version|Filter|n-shot|  Metric  |Value |   |Stderr|
 |--------------|-------|------|-----:|----------|-----:|---|-----:|
 |arc_challenge |Yaml   |none  |    25|acc       |0.7133|±  |0.0132|
@@ -25,6 +28,4 @@ model-index:
 |piqa          |Yaml   |none  |     0|acc       |0.8411|±  |0.0085|
 |              |       |none  |     0|acc_norm  |0.8526|±  |0.0083|
 |sciq          |Yaml   |none  |     0|acc       |0.9600|±  |0.0062|
-|              |       |none  |     0|acc_norm  |0.9370|±  |0.0077|

 ---
+license: apache-2.0
 tags:
 - llama-factory
 - lora
 ---
 # UNAversal-2x7B-v1
+Merely Phase 1 UNA, only MLP's and its kinda of a beta. The goal was to produce a small but powerful MoE.
+This is a 2 MoE model, of 7B each expert. Based on intel-neural series v3.
 |    Tasks     |Version|Filter|n-shot|  Metric  |Value |   |Stderr|
 |--------------|-------|------|-----:|----------|-----:|---|-----:|
 |arc_challenge |Yaml   |none  |    25|acc       |0.7133|±  |0.0132|
 |piqa          |Yaml   |none  |     0|acc       |0.8411|±  |0.0085|
 |              |       |none  |     0|acc_norm  |0.8526|±  |0.0083|
 |sciq          |Yaml   |none  |     0|acc       |0.9600|±  |0.0062|
+|              |       |none  |     0|acc_norm  |0.9370|±  |0.0077|