Yuseung "Phillip" Lee's picture

Yuseung "Phillip" Lee

phillipinseoul

·

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

liked a Space 7 days ago

Doubiiu/ViewCrafter

upvoted a paper 11 days ago

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

liked a model 15 days ago

Qwen/Qwen2.5-VL-72B-Instruct

View all activity

Organizations

phillipinseoul's activity

upvoted a paper 11 days ago

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Paper • 2502.15894 • Published 16 days ago • 19

upvoted a paper 15 days ago

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published 17 days ago • 12

upvoted 2 papers 17 days ago

Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above

Paper • 2502.14127 • Published 18 days ago • 2

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

upvoted 2 papers 18 days ago

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

Paper • 2502.11271 • Published 21 days ago • 16

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 20 days ago • 50

upvoted 3 papers 20 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 23 days ago • 98

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published 24 days ago • 38

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published 23 days ago • 52

upvoted 2 papers 21 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published 25 days ago • 41

upvoted 3 papers 24 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 24 days ago • 182

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published 25 days ago • 28

Distillation Scaling Laws

Paper • 2502.08606 • Published 25 days ago • 46

upvoted 3 papers 25 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published about 1 month ago • 95

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published 27 days ago • 13

History-Guided Video Diffusion

Paper • 2502.06764 • Published 27 days ago • 12

upvoted 3 papers 27 days ago

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Paper • 2502.04728 • Published about 1 month ago • 19

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published 30 days ago • 64

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 30 days ago • 121