Submitted by Ruben Härle 16 KletterMix: Climbing Toward High-Quality German Pretraining Data Artificial Intelligence & Machine Learning Lab at TU Darmstadt 6