arxiv:2402.05828
Chris Lu
chrlu
AI & ML interests
None yet
Organizations
None yet
models
19
chrlu/zephyr-7b-gemma-bline-kto-unlabeled
Text Generation
•
Updated
chrlu/zephyr-7b-gemma-kto-2
Text Generation
•
Updated
•
21
chrlu/zephyr-7b-gemma-adaptive_confidence_margin_loss_213
Text Generation
•
Updated
chrlu/zephyr-7b-gemma-adaptive_quantile_feedback_loss
Text Generation
•
Updated
•
28
chrlu/zephyr-7b-gemma-dynamic_blended_adaptive_quantile_loss
Text Generation
•
Updated
•
32
chrlu/zephyr-7b-gemma-adaptive_blended_loss_with_temperature_scaling
Text Generation
•
Updated
chrlu/zephyr-7b-gemma-log_ratio_modulated_loss
Text Generation
•
Updated
•
25
chrlu/zephyr-7b-gemma-policy_focused_loss
Text Generation
•
Updated
•
25
chrlu/zephyr-7b-gemma-combined_exp_logistic_loss
Text Generation
•
Updated
•
26
chrlu/zephyr-7b-gemma-adaptive_quantile_loss
Text Generation
•
Updated
•
14
datasets
None public yet