mighty-llama-1b / README.md
winglian's picture
Update README.md
cc30631
|
raw
history blame
No virus
1.9 kB
### Single Pass
```
hf-causal-experimental (pretrained=openaccess-ai-collective/mighty-llama-1b,use_accelerate=True,dtype=bfloat16,trust_remote_code=True), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
| Task |Version| Metric |Value | |Stderr|
|-------------|------:|--------|-----:|---|-----:|
|arc_challenge| 0|acc |0.2355|_ |0.0124|
| | |acc_norm|0.2671|_ |0.0129|
|arc_easy | 0|acc |0.4444|_ |0.0102|
| | |acc_norm|0.4276|_ |0.0102|
|boolq | 1|acc |0.5358|_ |0.0087|
|hellaswag | 0|acc |0.3784|_ |0.0048|
| | |acc_norm|0.5034|_ |0.0050|
|openbookqa | 0|acc |0.1580|_ |0.0163|
| | |acc_norm|0.2840|_ |0.0202|
|piqa | 0|acc |0.6518|_ |0.0111|
| | |acc_norm|0.6464|_ |0.0112|
|winogrande | 0|acc |0.5422|_ |0.0140|
```
### 16x Passees
```
hf-causal-experimental (pretrained=openaccess-ai-collective/mighty-llama-1b,use_accelerate=True,dtype=bfloat16,trust_remote_code=True), limit: None, provide_description: False, num_fewshot: 0, batch_size: 32
| Task |Version| Metric |Value | |Stderr|
|-------------|------:|--------|-----:|---|-----:|
|arc_challenge| 0|acc |0.2287|_ |0.0123|
| | |acc_norm|0.2654|_ |0.0129|
|arc_easy | 0|acc |0.4785|_ |0.0103|
| | |acc_norm|0.4390|_ |0.0102|
|boolq | 1|acc |0.6076|_ |0.0085|
|hellaswag | 0|acc |0.3789|_ |0.0048|
| | |acc_norm|0.4877|_ |0.0050|
|openbookqa | 0|acc |0.1760|_ |0.0170|
| | |acc_norm|0.2800|_ |0.0201|
|piqa | 0|acc |0.6572|_ |0.0111|
| | |acc_norm|0.6589|_ |0.0111|
|winogrande | 0|acc |0.5541|_ |0.0140|
```