File size: 1,253 Bytes
6ec428f
 
 
 
 
 
3701452
6ec428f
 
f5a6577
6ec428f
3701452
 
 
 
6ec428f
3701452
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f5a6577
 
 
 
 
 
 
3701452
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
language: 
  - es
tags:
- text-generation
- bloom
license: "cc-by-sa-4.0"
---

# Bloom 1b1 for Spanish text generation

This model is a fine-tuned version of [bigscience/bloom-1b1](https://huggingface.co/bigscience/bloom-1b1) on Spanish datasets.
It achieves the following results on the evaluation set:
- Loss: 2.340
  
Model under development. Use with caution.


### Dataset Summary

 Model trained with [Large Spanish Corpus](https://huggingface.co/datasets/large_spanish_corpus) and a Spanish books corpus crawled from web and torrents.

### Preprocessing

Preprocessing performed by [spanish_nlp](https://github.com/jorgeortizfuentes/spanish_nlp).

### Licensing Information

The dataset is available under the [Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0)](https://creativecommons.org/licenses/by-sa/4.0/).

Some books may be subject to copyright. Use for academic purposes only.

### Citation Information

```
@misc {jorge_ortiz_fuentes_2023,
	author       = { {Jorge Ortiz Fuentes} },
	title        = { Bloom 1b1 for Spanish text generation },
	year         = 2023,
	url          = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish },
	doi          = { 10.57967/hf/0247 },
	publisher    = { Hugging Face }
}
```