iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 5.93k • 205
Malikeh1375/medical-question-answering-datasets Viewer • Updated Nov 2, 2023 • 1.26M • 481 • 25
llm-wizard/dolly-15k-instruction-alpaca-format Viewer • Updated Apr 13, 2023 • 15k • 680 • 25
Telugu-LLM-Labs/marathi_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 2 • 1
Vanessasml/cybersecurity_32k_instruction_input_output Viewer • Updated Apr 19 • 32.6k • 21 • 10
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C25-L25-E25-R05 Viewer • Updated Nov 29, 2023 • 10.1M • 4
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3 Viewer • Updated Mar 25 • 40k • 3
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_3_16 Viewer • Updated Mar 26 • 20k • 3
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft Viewer • Updated May 20 • 6.37M • 15
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_xlarge__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1M • 4
Mitsuki-Sakamoto/alpaca_farm-alpaca_instructions_gen_eval_sft Viewer • Updated Mar 7 • 1.2k • 7
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 2
y1xing/natural_language_prompt_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 276 • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16 Viewer • Updated Mar 26 • 20k • 3
phyloforfun/HLT_MICH_Angiospermae_SLTPvC_v1-0_medium_OCR-C25-L25-E50-R05 Viewer • Updated Mar 15 • 10k • 2 • 1
somosnlp-hackathon-2023/ask2democracy-cfqa-salud-pension Viewer • Updated Apr 11, 2023 • 3.81k • 2 • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 94.6k • 2
y1xing/orpo_llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2 Viewer • Updated Mar 7 • 60k • 4
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2_random Viewer • Updated Mar 10 • 60k • 4
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-8_random Viewer • Updated Mar 10 • 60k • 4
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 511k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.5 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 26 • 94.6k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.5 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.25 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.7 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_4 Viewer • Updated Apr 26 • 303k • 2
vinhtran2611/ArtifactAI_arxiv-physics-instruct-tune-30k_formated Viewer • Updated Jun 7 • 30.2k • 2
vinhtran2611/arxiv-physics-instruct-tune-30k_filtered_formated Viewer • Updated Jun 17 • 324 • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.25 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.9 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.9 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.3 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.5 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Telugu-LLM-Labs/assamese_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 2 • 1
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_tiny__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100 • 4
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 21 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_2 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alfa-deberta-re-pref-64-fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 26 • 94.6k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.9 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.9 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3 Viewer • Updated Apr 26 • 303k • 3
gogo8232/experiment_perplexity_instruction_llama3_8b_response Viewer • Updated Jul 5 • 34.9k • 2
oliverwang15/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Jul 11, 2023 • 67.2k • 2 • 9
lucasmccabe-lmi/sql-create-context_alpaca_style Viewer • Updated May 15, 2023 • 78.6k • 2 • 5
japneets/Alpaca_instruction_fine_tune_Punjabi_small Viewer • Updated Apr 16, 2023 • 10k • 2 • 1
filopedraz/swedish-sentiment-instruction-fine-tuning Viewer • Updated Jun 13, 2023 • 164k • 2 • 1
anton96vice/samantha-1.1-uncensored-split-and-prepared Viewer • Updated Mar 7 • 2.04k • 3 • 1
Telugu-LLM-Labs/konkani_alpaca_yahma_cleaned_filtered Viewer • Updated Mar 14 • 28.9k • 6 • 1
Hadnet/olavo-article-17k-llama2-chat-dataset-text Viewer • Updated Sep 25, 2023 • 17.4k • 4 • 1
UMCU/WikiDocPatientInformation_Dutch_translated_with_MariaNMT Viewer • Updated Jan 22 • 5.76k • 4
Cesar7980/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 8, 2023 • 76.8k • 4
rodrfons/fingpt_chatglm2_sentiment_instruction_lora_ft_dataset Viewer • Updated Nov 18, 2023 • 76.8k • 13
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1.0_OCR-C25-L25-E50-R10 Viewer • Updated Nov 29, 2023 • 230 • 4
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10.1M • 4
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_tiny__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 87 • 4
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_large__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 100k • 4
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C25-L25-E50-R05 Viewer • Updated Nov 30, 2023 • 10k • 4
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_large__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 100k • 4
mfmezger/sandboxai_german_to_english_translations_seperated Viewer • Updated Feb 15 • 1.35M • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_160m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_0.3_self_160m Viewer • Updated Mar 21 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-12_filter_gold_thr_1.0_self_160m Viewer • Updated Mar 21 • 18.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_1.0_seed_1 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_2 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 24 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_3 Viewer • Updated Mar 24 • 189k • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_0.3_seed_3 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.3_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1 Viewer • Updated Mar 25 • 40k • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_2_t_1.0 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.75 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0_eval Viewer • Updated Mar 28 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0_eval Viewer • Updated Mar 29 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0_eval Viewer • Updated Mar 29 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
thusinh1969/llama-2-7b-LongContext-mixed-64k-30APRIL2024 Viewer • Updated May 1 • 81.8k • 2 • 1
HachiML/oasst1_for_self-rewarding_EFT_Mixtral-8x22B-Instruct Viewer • Updated May 29 • 5.24k • 2
murugeshmarvel/a5d87d8c1326b4f0c531065dbe7f5068a2bab8a56edc9a9d4aab95be427bb171 Viewer • Updated Jun 5 • 95k • 2
generative-technologies/synth-ehr-icd10-llama3-format Viewer • Updated Jun 23 • 379k • 21 • 1
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_small__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 1.01k • 4
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_medium__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 10k • 4
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_full__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1.42M • 4
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.1_self_160m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 23 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 24 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 24 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_410m_thr_1.0_seed_2 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2 Viewer • Updated Mar 25 • 40k • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_14m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0 Viewer • Updated Apr 19 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.9 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_0.5 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.5 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo2_100_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.1 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.3 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.1 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.7 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_tp_0.9 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_1.0_eval Viewer • Updated Mar 28 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-RM-Mistral-7B-re-preference-256-nsample-2 Viewer • Updated Apr 15 • 20k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1 Viewer • Updated Apr 26 • 303k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_5 Viewer • Updated Apr 26 • 303k • 2
phyloforfun/HLT_MICH_Angiospermae_SLTPvA_v1-0_medium__OCR-C35-L35-E100-R01 Viewer • Updated Nov 30, 2023 • 10k • 4
phyloforfun/HLT_Kew_WCVP_SLTPvA_v1-0_small__T20-OCR-C25-L25-E50-R10 Viewer • Updated Dec 1, 2023 • 1k • 4
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 14 • 37.9k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_2 Viewer • Updated Mar 21 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.3_seed_1 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_2 Viewer • Updated Mar 22 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo16_2_64_mix_50_kl_0.1_prm_160m_thr_0.1_seed_3 Viewer • Updated Mar 24 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.1_seed_1 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-_fil_self_1.4b_bo2_100_kl_0.1_prm_160m_thr_0.3_seed_3 Viewer • Updated Mar 25 • 189k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_1.0 Viewer • Updated Apr 19 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_3_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.75 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.3 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_tp_0.5 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_1.0_seed_1_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.5_seed_2_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.3_seed_3_t_1.0_eval Viewer • Updated Mar 30 • 568k • 2
y1xing/natural_language_prompt_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_160m_thr_0.0_seed_1_t_1.0 Viewer • Updated Mar 25 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_1_16 Viewer • Updated Mar 26 • 20k • 3
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_1_t_0.75 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_3_t_0.25 Viewer • Updated Mar 26 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.1 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_tp_0.7 Viewer • Updated Mar 27 • 568k • 2
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0_eval Viewer • Updated Mar 28 • 568k • 2
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2 Viewer • Updated Apr 26 • 303k • 2
y1xing/llama3_concatenated_data_with_chris_examples_orpo_instruct_dataset Viewer • Updated Jul 6 • 2.64k • 2
y1xing/llama_chris_examples_generated_synthetic_data_instruct_dataset Viewer • Updated Jul 13 • 1.85k • 2
y1xing/partially_correct_llama_all_synthetic_data_instruct_dataset Viewer • Updated Jul 14 • 1.53k • 2
y1xing/llama_all_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 14 • 435 • 2
Mitsuki-Sakamoto/alpaca_farm-alpaca_gpt4_preference-re-preference_eval Viewer • Updated Jan 15 • 197k • 2
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-eval-preference Viewer • Updated Feb 5 • 2k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-are-preference-256 Viewer • Updated Mar 1 • 22k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test Viewer • Updated Apr 19 • 40 • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-4 Updated Mar 6 • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-8 Viewer • Updated Mar 6 • 20k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-256-nsample-16 Viewer • Updated Mar 7 • 20k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-16_random Viewer • Updated Mar 10 • 60k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_70m Viewer • Updated Mar 15 • 37.9k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 18 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 18 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.1_self_160m Updated Mar 21 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.5_self_160m Updated Mar 18 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.2_self_160m Viewer • Updated Mar 15 • 37.9k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 18 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.0_self_160m Viewer • Updated Mar 18 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_70m Viewer • Updated Mar 19 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_70m Viewer • Updated Mar 19 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_70m Viewer • Updated Mar 19 • 189k • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.1_self_160m Updated Mar 19 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.5_self_160m Updated Mar 19 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-2_iso_filter_gold_thr_0.0_self_160m Updated Mar 19 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_0.3_self_160m Updated Mar 21 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-preference-64-nsample-16_filter_gold_thr_1.0_self_160m Updated Mar 21 • 1
Mitsuki-Sakamoto/alpaca_farm-deberta-re-pref-64-fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.0_seed_2_t_1.0 Viewer • Updated Apr 19 • 568k • 1
Mitsuki-Sakamoto/fil_self_160m_bo16_2_mix_50_kl_0.1_prm_70m_thr_0.1_seed_3_t_1.0_eval Viewer • Updated Mar 29 • 568k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_4 Viewer • Updated Apr 25 • 40k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_5 Viewer • Updated Apr 25 • 40k • 1
Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-test-alpaca-gen Viewer • Updated May 12 • 20 • 1
Karmukilan/Malikeh1375_medical-question-answering-datasets Viewer • Updated Jul 16 • 1k • 6 • 1
y1xing/natural_language_prompt_w_correct_ans_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 276 • 2
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_instruct_dataset Viewer • Updated Jul 26 • 435 • 2
y1xing/natural_language_prompt_w_correct_ans_dataset_json_evaluation_instruct_dataset Viewer • Updated Jul 29 • 276 • 2
y1xing/natural_language_prompt_w_correct_ans_synthetic_dataset_evaluation_json_instruct_dataset Viewer • Updated Jul 29 • 435 • 2
y1xing/natural_language_prompt_w_correct_ans_dataset_training_instruct_dataset Viewer • Updated Jul 30 • 2.99k • 2
UMCU/MedicalFlashCards_Dutch_translated_with_MariaNMT Viewer • Updated Oct 31, 2023 • 32.9k • 4
Mitsuki-Sakamoto/sft_alpaca_pythia-1.4b-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 1
Mitsuki-Sakamoto/sft_alpaca_pythia-160m-use_response_template-deberta-v3 Viewer • Updated Aug 1 • 20k • 1
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned Viewer • Updated Aug 8 • 6.36M • 7
purulalwani/Synthetic-Financial-Datasets-For-Fraud-Detection-Cleaned-Split Viewer • Updated Aug 8 • 6.36M • 4
louisbrulenaudet/code-pensions-civiles-militaires-retraite Viewer • Updated about 8 hours ago • 257 • 6
louisbrulenaudet/code-disciplinaire-penal-marine-marchande Viewer • Updated about 8 hours ago • 6 • 6
louisbrulenaudet/code-domaine-public-fluvial-navigation-interieure Viewer • Updated about 8 hours ago • 2 • 6
louisbrulenaudet/code-domaine-etat-collectivites-mayotte Viewer • Updated about 8 hours ago • 3 • 6
louisbrulenaudet/code-legion-honneur-medaille-militaire-ordre-national-merite Viewer • Updated about 8 hours ago • 220 • 6
louisbrulenaudet/code-propriete-personnes-publiques Viewer • Updated about 8 hours ago • 1.13k • 6
louisbrulenaudet/code-postes-communications-electroniques Viewer • Updated about 8 hours ago • 729 • 8
louisbrulenaudet/code-instruments-monetaires-medailles Viewer • Updated about 8 hours ago • 6 • 6
Epic3123/election_misinformation_sleeper_agents_dataset_llama27b Viewer • Updated Aug 29 • 733 • 2
DLI-Lab/world_model_for_wa_desc_with_tao_formatted_w_cot Viewer • Updated about 1 month ago • 14.7k • 90
FoxySapiens/teknofest-egitim-hukuk-tarim-surdurulebilirlik-dataset Viewer • Updated 28 days ago • 233k • 166
DLI-Lab/Mind2Web-cleaned-lite-desc-with-tao-formatted Viewer • Updated 20 days ago • 6.13k • 20
DLI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted Viewer • Updated 20 days ago • 6.13k • 32
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted Viewer • Updated 20 days ago • 6.13k • 4
DLI-Lab/Mind2Web-cleaned-lite-value-model-w-cot-formatted-test Viewer • Updated 17 days ago • 6.13k • 2
DLI-Lab/Mind2Web-cleaned-lite-reward-model-w-cot-formatted-v2 Viewer • Updated 16 days ago • 6.13k • 2
tayyibsupercool/resource_allocation_telecom_spectral_efficiency_instruct Viewer • Updated 8 days ago • 100k • 98 • 1
tayyibsupercool/resource_allocation_telecom_energy_efficiency_instruct Viewer • Updated 8 days ago • 100k • 251 • 1
DLI-Lab/Mind2Web-cleaned-lite-acctree-value-model-w-cot-formatted Viewer • Updated 9 days ago • 6.13k • 2
DLI-Lab/world_model_for_wa_acctree_w_cot-formatted-test Viewer • Updated 6 days ago • 14.7k • 8
JiaweiGuo123/Alpaca-gpt4-English-with-gsm8k-semantic-similarity Viewer • Updated 3 days ago • 52k • 9
aamina/channel_gains_vs_tx_powers_ee_augmented_with_context_10k Viewer • Updated about 22 hours ago • 10k • 7
Self-GRIT/open-hermes-2.5-sft-llama3-inference-query-reformulation-tokens Viewer • Updated about 19 hours ago • 33.3k • 17