Zhihe Yang's picture

2 4

Zhihe Yang

zhyang2226

·

AI & ML interests

Trustworthy RL & Offline RL

Recent Activity

upvoted a paper 3 days ago

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

liked a Space 6 days ago

huggingface/ai-deadlines

liked a dataset 18 days ago

openbmb/RLAIF-V-Dataset

View all activity

Organizations

models 2

zhyang2226/opadpo-lora_llava-v1.5-13b

zhyang2226/opadpo-lora_llava-v1.5-7b

datasets

None public yet