2 14 32

Jieneng Chen

jienengchen

https://beckschen.github.io/

AI & ML interests

multi-modal LLMs

Recent Activity

liked a model 17 days ago

JeffreyXiang/TRELLIS-image-large

liked a model about 1 month ago

deepseek-ai/Janus-Pro-7B

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

jienengchen's activity

liked a model 17 days ago

JeffreyXiang/TRELLIS-image-large

Image-to-3D • Updated Dec 6, 2024 • 817k • 393

liked 3 models about 1 month ago

upvoted a paper about 1 month ago

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Paper • 2501.07730 • Published Jan 13 • 16

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 5 days ago • 3.29M • • 3.57k

upvoted a paper about 2 months ago

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published Jan 10 • 31

liked a model about 2 months ago

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 1.25M • • 3.46k

liked a dataset about 2 months ago

ccvl/3DSRBench

Viewer • Updated 26 days ago • 5.16k • 577 • 5

upvoted a paper 2 months ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published Dec 19, 2024 • 26

liked a model 2 months ago

THUDM/CogVideoX1.5-5B-I2V

Image-to-Video • Updated Nov 20, 2024 • 21.7k • 95

authored a paper 2 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

upvoted a paper 2 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

commented a paper 2 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90 •

authored a paper 3 months ago

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

Paper • 2412.07825 • Published Dec 10, 2024 • 11

upvoted a paper 3 months ago

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

Paper • 2412.07825 • Published Dec 10, 2024 • 11

liked a model 3 months ago

tencent/HunyuanVideo

Text-to-Video • Updated Jan 21 • 7.49k • • 1.7k

authored a paper 3 months ago

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Paper • 2404.02132 • Published Apr 2, 2024 • 2

commented a paper 3 months ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 76 •

authored a paper 3 months ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 76