Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published Nov 11, 2024 • 36
Running 100 100 TxT360: Trillion Extracted Text 📖 Create a large, deduplicated dataset for LLM pre-training
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • Updated Sep 13, 2024 • 3.54k • 37
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8, 2024 • 158