arxiv:2310.12773
Juntao Dai
calico-1226
AI & ML interests
RLHF
Organizations
Papers
2
models
12
calico-1226/gold-model-0921-ultra
Updated
calico-1226/gold-model-0920-ultra
Updated
calico-1226/sheared-llama-2.7B_unfreeze_augmentation_0919
Updated
calico-1226/sheared-llama-2.7B_unfreeze_rescored-data_0919
Updated
calico-1226/gold-model-0918-ultra
Updated
calico-1226/scorelm_Sheared_LLaMA_2.7B_unfreeze_0917
Updated
calico-1226/scorelm_openllama_3b_v2_unfreeze_0916
Updated
•
2
calico-1226/openllama_3b_v2_unfreeze_0916
Updated
calico-1226/gpt2_774m_0910
Updated
•
2
calico-1226/alpaca-7b-sft
Updated