1 28 121

Jie

JJ-TMT

AI & ML interests

None yet

Recent Activity

liked a dataset 11 days ago

Yuqi1997/DrivingDojo

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

None yet

JJ-TMT's activity

liked a dataset 11 days ago

Yuqi1997/DrivingDojo

Preview • Updated Oct 24, 2024 • 79 • 7

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 340

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 3.64M • • 11k

jinaai/ReaderLM-v2

Text Generation • Updated 5 days ago • 29.5k • • 551

authored 3 papers about 2 months ago

CityBench: Evaluating the Capabilities of Large Language Model as World Model

Paper • 2406.13945 • Published Jun 20, 2024

CityGPT: Empowering Urban Spatial Cognition of Large Language Models

Paper • 2406.13948 • Published Jun 20, 2024

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted 2 papers about 2 months ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 274

liked a dataset 2 months ago

amphora/QwQ-LongCoT-130K

Viewer • Updated Dec 22, 2024 • 133k • 390 • 143

liked a model 2 months ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated Jan 12 • 185k • • 562

liked a model 3 months ago

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 273k • 360

upvoted a paper 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

liked a model 3 months ago

g-astruc/AnySat

Updated Jan 7 • 10

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

liked a dataset 3 months ago

foursquare/fsq-os-places

Viewer • Updated 3 days ago • 105M • 1.37k • 69

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

liked a model 3 months ago

OpenGVLab/InternVL2_5-78B

Image-Text-to-Text • Updated Feb 5 • 4.63k • 180

upvoted a collection 3 months ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 1 day ago • 145

liked a model 3 months ago

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated 3 days ago • 77.9k • 406