Banghua Zhu's picture

Banghua Zhu

banghua

·

https://people.eecs.berkeley.edu/~banghua/

AI & ML interests

Foundation models, statistics, information theory

Recent Activity

liked a Space 13 days ago

bigcomputer/SWE-Arena

updated a model 3 months ago

Nexusflow/Athene-V2-Chat

new activity 4 months ago

Nexusflow/Athene-V2-Chat:inference api not working

View all activity

Organizations

banghua's activity

upvoted a collection 5 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 11 days ago • 552

upvoted a paper 9 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 7

upvoted a collection 12 months ago

Starling

2 items • Updated Mar 20, 2024 • 7

upvoted 6 papers over 1 year ago

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons

Paper • 2301.11270 • Published Jan 26, 2023 • 2

Online Learning in Stackelberg Games with an Omniscient Follower

Paper • 2301.11518 • Published Jan 27, 2023 • 1

Jump-Start Reinforcement Learning

Paper • 2204.02372 • Published Apr 5, 2022 • 1

Doubly Robust Self-Training

Paper • 2306.00265 • Published Jun 1, 2023 • 1

On Optimal Caching and Model Multiplexing for Large Model Inference

Paper • 2306.02003 • Published Jun 3, 2023 • 1

Fine-Tuning Language Models with Advantage-Induced Policy Alignment

Paper • 2306.02231 • Published Jun 4, 2023 • 2