license: apache-2.0 | |
| Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss | | |
| --- | --- | --- | --- | --- | --- | --- | | |
| GerbilLab/Gerbil-B-3.3m | 3.3m | B-Class | 42 | 126M | 65.5k | 6.0822 | |
license: apache-2.0 | |
| Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss | | |
| --- | --- | --- | --- | --- | --- | --- | | |
| GerbilLab/Gerbil-B-3.3m | 3.3m | B-Class | 42 | 126M | 65.5k | 6.0822 | |