James Burgess's picture

2 12 4

James Burgess

jmhb

·

https://jmhb0.github.io/

jmhb0
jmhb0

AI & ML interests

Diffusion models, 3D vision

Recent Activity

upvoted a collection about 9 hours ago

liked a model 9 days ago

baichuan-inc/Baichuan-M1-14B-Instruct

upvoted a paper 9 days ago

Temporal Preference Optimization for Long-Form Video Understanding

View all activity

Organizations

None yet

jmhb's activity

upvoted a collection about 9 hours ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 4 days ago • 81

upvoted a paper 9 days ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published 11 days ago • 21

upvoted a paper 17 days ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published Dec 19, 2024 • 26

upvoted 2 papers 19 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 21 days ago • 89

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published 21 days ago • 49

upvoted a paper 6 months ago

Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning

Paper • 2408.07931 • Published Aug 15, 2024 • 20

upvoted 3 papers 7 months ago

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8, 2024 • 26

Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models

Paper • 2309.07986 • Published Sep 14, 2023 • 3

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Paper • 2407.01791 • Published Jul 1, 2024 • 5

upvoted a paper 10 months ago

MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation

Paper • 2404.11565 • Published Apr 17, 2024 • 15

upvoted a paper about 1 year ago

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Paper • 2401.05583 • Published Jan 10, 2024 • 10

upvoted a paper over 1 year ago

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40