File size: 1,642 Bytes
654a627
 
 
 
 
e2295f8
 
3c1a3c3
 
 
 
 
 
 
 
 
 
e2295f8
1a3de10
 
 
 
 
 
 
e2295f8
 
 
 
 
654a627
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
library_name: transformers
tags: []
---

# Mula 4x160m

| Step   | Evaluation Loss | Perplexity |
|--------|-----------------|------------|
| 50000  | 3.03            | 20.73      |
| 100000 | 2.84            | 17.14      |
| 150000 | 2.73            | 15.35      |
| 200000 | 2.64            | 14.05      |
| 250000 | 2.56            | 12.95      | 
| 300000 | 2.49            | 12.14      |
| 350000 | 2.46            | 11.75      | 
| 400000 | 2.46            | 11.72      |

|                  | **ARC**   | **HellaSwag** | **MMLU**  | **TruthfulQA** | **Average** |
|------------------|-----------|---------------|-----------|----------------|-------------|
| **TTL-460m**     | 29.40     | 33.00         | 28.55     | 41.10          | 33.01       |
| **TTL-160m**     | 26.15     | 29.29         | 28.11     | 41.12          | 31.16       |
| **Mula-4x160m**  | 27.09     | 31.41         | 28.15     | 39.81          | 31.61       |


|                   | **ASSIN2 RTE** | **ASSIN2 STS** | **BLUEX** | **ENEM** | **FAQUAD NLI** | **HateBR** | **OAB Exams** | **Average** |
|-------------------|----------------|----------------|-----------|----------|----------------|------------|---------------|-------------|
| **TTL-460m**      | 53.93          | 12.66          | 22.81     | 19.87    | 49.01          | 33.59      | 27.06         | 31.27       |
| **TTL-160m**      | 53.36          | 2.58           | 21.84     | 18.75    | 43.97          | 36.88      | 22.60         | 28.56       |
| **Mula-4x160m**   | 33.55          | 8.88           | 20.58     | 20.08    | 43.97          | 33.65      | 22.92         | 26.23       |