dreamorg/Meta-Llama-3.1-70B-Instruct_1220_llama3.3_bright_12k_v3_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 18.2k • 160
dreamorg/Meta-Llama-3.1-70B-Instruct_1220_llama3.3_bright_12k_v3_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 18.2k • 160
dreamorg/Meta-Llama-3.1-70B-Instruct_1130_bright_original_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 12.2k • 126
dreamorg/Meta-Llama-3.1-70B-Instruct_1130_bright_original_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 12.2k • 126
dreamorg/Meta-Llama-3.1-70B-Instruct_1206_bright_12k_v6_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 11.6k • 156
dreamorg/Meta-Llama-3.1-70B-Instruct_1206_bright_12k_v6_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 11.6k • 156
dreamorg/Meta-Llama-3.1-70B-Instruct_1202_bright_120k_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 111k • 116
dreamorg/Meta-Llama-3.1-70B-Instruct_1202_bright_120k_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 111k • 116
dreamorg/Meta-Llama-3.1-70B-Instruct_1129_bright_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 12.2k • 124
dreamorg/Meta-Llama-3.1-70B-Instruct_1129_bright_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 12.2k • 124
dreamorg/Meta-Llama-3.1-70B-Instruct_1217_msmacro_v6_2k_bright_v3_12k_reasoning_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 13.9k • 135
dreamorg/Meta-Llama-3.1-70B-Instruct_1217_msmacro_v6_2k_bright_v3_12k_reasoning_seed_doc_pos_random_neg Viewer • Updated 29 days ago • 13.9k • 135
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 31
Language models scale reliably with over-training and on downstream tasks Paper • 2403.08540 • Published Mar 13, 2024 • 15