3 18 160

JingyeChen22

https://jingyechen.github.io

JingyeChen

AI & ML interests

OCR, Document Analysis, Text-to-X

Recent Activity

liked a Space about 2 months ago

black-forest-labs/FLUX.1-dev

liked a model about 2 months ago

videophysics/videocon_physics

liked a dataset about 2 months ago

videophysics/videophy_train_public

View all activity

Organizations

JingyeChen22's activity

upvoted a paper 2 months ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

upvoted a collection 2 months ago

RoLoRA

Collection

[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization • 3 items • Updated Sep 26, 2024 • 3

upvoted a paper 2 months ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

upvoted 4 papers 5 months ago

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Paper • 2410.01647 • Published Oct 2, 2024 • 28

upvoted a paper 6 months ago

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 29

upvoted 3 papers about 1 year ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 128

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31

MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising

Paper • 2312.10899 • Published Dec 18, 2023 • 15

upvoted a paper over 1 year ago

LLM-FP4: 4-Bit Floating-Point Quantized Transformers

Paper • 2310.16836 • Published Oct 25, 2023 • 14

upvoted a collection over 1 year ago

🕹️ AI Games

Collection

An ongoing collection of games you can play on HF Spaces • 14 items • Updated Oct 3, 2024 • 27

upvoted 5 papers over 1 year ago

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Paper • 2310.00426 • Published Sep 30, 2023 • 60

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 33

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50

TokenFlow: Consistent Diffusion Features for Consistent Video Editing

Paper • 2307.10373 • Published Jul 19, 2023 • 57

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244