slippylolo commited on
Commit
1a8b638
1 Parent(s): d3685ba

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +51 -0
README.md ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - tiiuae/falcon-refinedweb
4
+ language:
5
+ - en
6
+ ---
7
+
8
+ # Falcon-RW-1B
9
+
10
+ **Falcon-RW-1B is a 1B parameters causal decoder-only model built by [TII](https://www.tii.ae) and trained on 350B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb). It is made available under the [TII Falcon LLM License](https://huggingface.co/tiiuae/falcon-rw-1b/blob/main/LICENSE.txt).**
11
+
12
+ RefinedWeb is a high-quality web dataset built by leveraging stringent filtering and large-scale deduplication. Falcon-RW-1B, trained on RefinedWeb only, matches or outperforms comparable models trained on curated data.
13
+
14
+ This model is intended for use as a research artifact, to study the influence of training on appropriately filtered web data alone.
15
+
16
+
17
+ # Model Card for Falcon-RW-1B
18
+
19
+
20
+ ## Model Details
21
+
22
+ ### Model Description
23
+
24
+ - **Developed by:** [https://www.tii.ae](https://www.tii.ae)
25
+ - **Model type:** Causal decoder-only
26
+ - **Language(s) (NLP):** English
27
+ - **License:** TII Falcon LLM License
28
+
29
+ ### Model Source
30
+
31
+ - **Paper:** coming soon
32
+ - **Demo:** coming soon
33
+
34
+ ## Uses
35
+
36
+ ### Direct Use
37
+
38
+ Research on large language models, and the influence of adequately filtered and deduplicated web data on the properties of large language models (fairness, safety, limitations, capabilities, etc.).
39
+
40
+ ### Out-of-Scope Use
41
+
42
+ Production use without adequate assessment of risks and mitigation; any use cases which may be considered irresponsible or harmful
43
+
44
+ ## Bias, Risks, and Limitations
45
+
46
+ Falcon-RW models are trained on English data only, and will not generalize appropriately to other languages. Furthermore, as they are trained on a large-scale corpora representative of the web, they will carry the stereotypes and biases commonly encountered online
47
+
48
+
49
+ ## Paper
50
+
51
+ More details coming soon in the paper.