5 29 83

Pu Fanyi

pufanyi

https://pufanyi.github.io

AI & ML interests

Recent Activity

upvoted a paper 1 day ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

upvoted a paper 1 day ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

liked a dataset 4 days ago

lmms-lab/multimodal-open-r1-8k-verified

View all activity

Organizations

pufanyi's activity

upvoted 2 papers 1 day ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 6 days ago • 88

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 5

upvoted a paper 10 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 11 days ago • 22

upvoted a paper 19 days ago

Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 3

upvoted a paper 27 days ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

upvoted 4 papers about 1 month ago

upvoted a paper about 2 months ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 33

upvoted an article 2 months ago

Article

Fine-tuning Mistral on Your Dataset

•

Jul 22, 2024

• 19

upvoted a paper 2 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted a paper 3 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19, 2024 • 25

upvoted 3 collections 3 months ago

Oryx-1.5

Collection

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution • 4 items • Updated 19 days ago • 5

Oryx

Collection

Oryx: One Multi-Modal LLM for On-Demand Spatial-Temporal Understanding • 6 items • Updated Dec 11, 2024 • 16

LongVA

Collection

Long Context Transfer From Text To Vision: https://lmms-lab.github.io/posts/longva/ • 5 items • Updated Oct 4, 2024 • 13

upvoted a paper 3 months ago

Quantifying the Carbon Emissions of Machine Learning

Paper • 1910.09700 • Published Oct 21, 2019 • 13

upvoted 3 papers 4 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 35

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38