mighty-llama-1b / README.md
winglian's picture
Update README.md
cc30631
|
raw
history blame
No virus
1.9 kB

Single Pass

hf-causal-experimental (pretrained=openaccess-ai-collective/mighty-llama-1b,use_accelerate=True,dtype=bfloat16,trust_remote_code=True), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
|    Task     |Version| Metric |Value |   |Stderr|
|-------------|------:|--------|-----:|---|-----:|
|arc_challenge|      0|acc     |0.2355|_  |0.0124|
|             |       |acc_norm|0.2671|_  |0.0129|
|arc_easy     |      0|acc     |0.4444|_  |0.0102|
|             |       |acc_norm|0.4276|_  |0.0102|
|boolq        |      1|acc     |0.5358|_  |0.0087|
|hellaswag    |      0|acc     |0.3784|_  |0.0048|
|             |       |acc_norm|0.5034|_  |0.0050|
|openbookqa   |      0|acc     |0.1580|_  |0.0163|
|             |       |acc_norm|0.2840|_  |0.0202|
|piqa         |      0|acc     |0.6518|_  |0.0111|
|             |       |acc_norm|0.6464|_  |0.0112|
|winogrande   |      0|acc     |0.5422|_  |0.0140|

16x Passees

hf-causal-experimental (pretrained=openaccess-ai-collective/mighty-llama-1b,use_accelerate=True,dtype=bfloat16,trust_remote_code=True), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
|    Task     |Version| Metric |Value |   |Stderr|
|-------------|------:|--------|-----:|---|-----:|
|arc_challenge|      0|acc     |0.2287|_  |0.0123|
|             |       |acc_norm|0.2654|_  |0.0129|
|arc_easy     |      0|acc     |0.4785|_  |0.0103|
|             |       |acc_norm|0.4390|_  |0.0102|
|boolq        |      1|acc     |0.6076|_  |0.0085|
|hellaswag    |      0|acc     |0.3789|_  |0.0048|
|             |       |acc_norm|0.4877|_  |0.0050|
|openbookqa   |      0|acc     |0.1760|_  |0.0170|
|             |       |acc_norm|0.2800|_  |0.0201|
|piqa         |      0|acc     |0.6572|_  |0.0111|
|             |       |acc_norm|0.6589|_  |0.0111|
|winogrande   |      0|acc     |0.5541|_  |0.0140|