arxiv:2402.06457
Arian Hosseini
arianhosseini
AI & ML interests
large language models, reasoning, planning, systematic generalization
Organizations
Papers
1
models
8
arianhosseini/sample_ver
Updated
arianhosseini/ver_base
Text Generation
•
Updated
arianhosseini/zephyr-7b-dpo-qlora
Updated
arianhosseini/sample_gen
Updated
arianhosseini/pythia410m-tldr-dpo-1b-relbl-10k
Updated
arianhosseini/pythia410m-tldr-dpo-1b-relbl
Updated
arianhosseini/pythia410m-tldr-dpo-1b-relbl_10k
Updated
arianhosseini/llama-2-7b-gsm8k-lora
Updated
datasets
14
arianhosseini/hh_sft
Viewer
•
Updated
arianhosseini/hh_with_prompt
Viewer
•
Updated
arianhosseini/ultrafeedback_binarized_relabel1b
Viewer
•
Updated
•
29
arianhosseini/summ_dpo1b1_ngen10_max_2ndmax
Viewer
•
Updated
arianhosseini/summ_dpo1b1_ngen10_minmax
Viewer
•
Updated
arianhosseini/comparisons_20k_regen_labeled_dpo1b1
Viewer
•
Updated
arianhosseini/quail_with_tree_depth
Viewer
•
Updated
arianhosseini/summarize_dpo1b1_ngen10_20k
Viewer
•
Updated
arianhosseini/swag_formatted_to_quail
Viewer
•
Updated
arianhosseini/openai_comparisons_20k_regen_and_relabelled
Viewer
•
Updated