fineweb-1B / README.md
ddh0's picture
Create README.md
443baae verified

Sample of ~1B tokens from fineweb 15T, tokenized with custom Llama 3.2 1B tokenizer. For personal use