Sample of ~1B tokens from fineweb 15T, tokenized with custom Llama 3.2 1B tokenizer. For personal use