loubnabnl HF staff commited on
Commit
8a43dba
1 Parent(s): dece6db
Files changed (1) hide show
  1. datasets/incoder.txt +1 -1
datasets/incoder.txt CHANGED
@@ -1,4 +1,4 @@
1
- [InCoder](https://huggingface.co/facebook/incoder-6B) was trained on trained on **216 GB** of data from Github and Stackoverflow from 28 programming languages. 52 GB is in Python, 107GB in other programming languages and 57GB is content from Stackoverflow that isn't code.
2
 
3
  The Github data used the following filtering:
4
  - Average line length < 100
 
1
+ [InCoder](https://huggingface.co/facebook/incoder-6B) was trained on **216 GB** of data from Github and Stackoverflow from 28 programming languages. 52 GB is in Python, 107GB in other programming languages and 57GB is content from Stackoverflow that isn't code.
2
 
3
  The Github data used the following filtering:
4
  - Average line length < 100