arxiv:2310.12773
Juntao Dai
calico-1226
AI & ML interests
RLHF
Recent Activity
updated
a model
about 1 month ago
calico-1226/alpaca-7b_unfreeze_original_1119
updated
a model
about 1 month ago
calico-1226/sheared-llama-2.7B_unfreeze_original_1118
updated
a model
about 1 month ago
calico-1226/bspo_eps1e-5_s42_1114
Organizations
Papers
2
models
16
calico-1226/alpaca-7b_unfreeze_original_1119
Updated
•
5
calico-1226/sheared-llama-2.7B_unfreeze_original_1118
Updated
•
2
calico-1226/bspo_eps1e-5_s42_1114
Text Generation
•
Updated
•
7
calico-1226/gold-model-0921-ultra
Updated
calico-1226/gold-model-0920-ultra
Updated
•
3
calico-1226/sheared-llama-2.7B_unfreeze_augmentation_0919
Updated
•
3
calico-1226/sheared-llama-2.7B_unfreeze_rescored-data_0919
Updated
•
3
calico-1226/gold-model-0918-ultra
Updated
•
3
calico-1226/scorelm_Sheared_LLaMA_2.7B_unfreeze_0917
Updated
•
2
calico-1226/scorelm_openllama_3b_v2_unfreeze_0916
Updated
•
3