Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a collection
about 2 months ago
Alignment Personalization
updated
a collection
about 2 months ago
Alignment Personalization
updated
a collection
about 2 months ago
Alignment Personalization
Organizations
Collections
2
Papers
1
models
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
106
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62a3f93fe2b7740fe2a94c86/ZiaPqiVqXI2ANIyWQY_hT.png)
nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
51
datasets
85
nbalepur/persona-inference
Viewer
•
Updated
•
1.2k
•
72
nbalepur/persona-tailoring
Viewer
•
Updated
•
5.35k
•
103
nbalepur/personas_vague
Viewer
•
Updated
•
37.8k
•
48
nbalepur/persona_qual_fixed6
Viewer
•
Updated
•
15
•
37
nbalepur/persona_qual_fixed5
Viewer
•
Updated
•
15
•
74
nbalepur/persona_qual_fixed4
Viewer
•
Updated
•
15
•
50
nbalepur/persona_qual_fixed3
Viewer
•
Updated
•
15
•
51
nbalepur/persona_qual_fixed2
Viewer
•
Updated
•
30
•
61
nbalepur/persona_qual_fixed
Viewer
•
Updated
•
30
•
42
nbalepur/persona_qual
Viewer
•
Updated
•
30
•
70