yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.05-alpha-0-step-59904 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.025-alpha-0-LATEST Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.025-alpha-0-step-19968 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.025-alpha-0-step-39936 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.025-alpha-0-step-59904 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.025-alpha-0-step-79872 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.01-alpha-0-step-39936 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.01-alpha-0-step-19968 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.01-alpha-0-step-59904 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.01-alpha-0-step-79872 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0375-alpha-0-LATEST Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0375-alpha-0-step-19968 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0375-alpha-0-step-59904 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0375-alpha-0-step-39936 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0375-alpha-0-step-79872 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0175-alpha-0-step-19968 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0175-alpha-0-step-79872 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0175-alpha-0-LATEST Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0175-alpha-0-step-59904 Text Generation • Updated 13 days ago
yaswanthchittepu/pythia-2.8b-tldr-ipo-beta-0.0175-alpha-0-step-39936 Text Generation • Updated 13 days ago
AlignmentResearch/robust_llm_pythia-31m_niki-044_imdb_random-token-1280_30-rounds_seed-3 Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_pythia-14m_niki-044_imdb_random-token-1280_30-rounds_seed-3 Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_pythia-31m_niki-044_imdb_random-token-1280_30-rounds_seed-4 Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_niki-test-per-round-model-saving Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_niki-test-saving-every-adv-round_adv-tr-round-0 Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_niki-test-saving-every-adv-round_adv-tr-round-1 Text Classification • Updated 13 days ago
AlignmentResearch/robust_llm_niki-test-saving-every-adv-round_adv-tr-round-2 Text Classification • Updated 13 days ago