Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
Zhihe Yang
zhyang2226
Follow
0 followers
·
1 following
AI & ML interests
Trustworthy RL & Offline RL
Recent Activity
upvoted
a
paper
3 days ago
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
liked
a Space
6 days ago
huggingface/ai-deadlines
liked
a dataset
18 days ago
openbmb/RLAIF-V-Dataset
View all activity
Organizations
models
2
Sort: Recently updated
zhyang2226/opadpo-lora_llava-v1.5-13b
Updated
Jan 16
zhyang2226/opadpo-lora_llava-v1.5-7b
Updated
Jan 16
datasets
None public yet