malteos commited on
Commit
c0c608e
1 Parent(s): 8a80532

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: bigscience-bloom-rail-1.0
3
+ datasets:
4
+ - oscar
5
+ language:
6
+ - de
7
+ library_name: transformers
8
+ pipeline_tag: text-generation
9
+ ---
10
+
11
+ # BLOOM-CLP German (6.4B parameters)
12
+
13
+ This is a monolingual German language model trained using the [CLP-Transfer](https://arxiv.org/abs/2301.09626) method based on [BLOOM-7b1](https://huggingface.co/bigscience/bloom-7b1).
14
+
15
+ You can try out the model at [European Language Grid](https://live.european-language-grid.eu/catalogue/tool-service/20825/try%20out/).
16
+
17
+ ## Training dataset
18
+
19
+ - ca. 50B German tokens
20
+ - Web-crawled content from the German subset [OSCAR v22.01](https://oscar-corpus.com/post/oscar-v22-01/) (excluding content tagged as header, footer, noisy, or adult)
21
+ - Web-crawled content from the [GC4 Corpus](https://german-nlp-group.github.io/projects/gc4-corpus.html) (including only the head and middle parts)
22
+ - German court decisions from [Open Legal Data](http://openlegaldata.io/)
23
+
24
+ ## Code
25
+
26
+ - [BigScience's Megatron-Deepspeed fork](https://github.com/bigscience-workshop/Megatron-DeepSpeed)
27
+
28
+ ## Hardware
29
+
30
+ - 32xA100-40GB GPUs
31
+
32
+ ## Evaluation
33
+
34
+ TBA (see paper)