Vikramjeet Singh's picture

Vikramjeet Singh

VikramSingh178

·

https://vikramxd.github.io

AI & ML interests

Computer Vision | Transformers| Diffusion Models | ML Systems

Organizations

VikramSingh178's activity

upvoted a paper 5 days ago

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published 5 days ago • 27

upvoted a paper 13 days ago

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published 16 days ago • 20

upvoted a paper 24 days ago

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published 27 days ago • 3

upvoted a collection about 1 month ago

3D Reconstruction

35 items • Updated 14 days ago • 2

upvoted a paper about 1 month ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2 • 40

upvoted 2 papers about 2 months ago

Colorful Diffuse Intrinsic Image Decomposition in the Wild

Paper • 2409.13690 • Published Sep 20 • 12

Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos

Paper • 2312.10300 • Published Dec 16, 2023 • 1

upvoted a paper 2 months ago

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5 • 57

upvoted an article 3 months ago

Article

Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code

Dec 5, 2023

• 4

upvoted a paper 3 months ago

IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts

Paper • 2408.03209 • Published Aug 6 • 21

upvoted 10 papers 4 months ago

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Paper • 2407.17365 • Published Jul 24 • 11

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

Paper • 2407.17470 • Published Jul 24 • 14

Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 16

Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition

Paper • 2402.15504 • Published Feb 23 • 21

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Paper • 2310.12190 • Published Oct 18, 2023 • 10

Semi-Parametric Neural Image Synthesis

Paper • 2204.11824 • Published Apr 25, 2022 • 1

Generate Anything Anywhere in Any Scene

Paper • 2306.17154 • Published Jun 29, 2023 • 22

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Paper • 2312.12490 • Published Dec 19, 2023 • 17

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 82

GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment

Paper • 2310.11513 • Published Oct 17, 2023 • 1