1 39 76

gerald hewes

gerald29

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

allenai/olmOCR-7B-0225-preview

upvoted a paper 7 days ago

LLM-based User Profile Management for Recommender System

upvoted a paper 7 days ago

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

View all activity

Organizations

None yet

gerald29's activity

liked a model 3 days ago

allenai/olmOCR-7B-0225-preview

Image-Text-to-Text • Updated 4 days ago • 16k • 298

upvoted 9 papers 7 days ago

LLM-based User Profile Management for Recommender System

Paper • 2502.14541 • Published 9 days ago • 5

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Paper • 2502.14802 • Published 8 days ago • 11

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Paper • 2502.14044 • Published 9 days ago • 7

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published 9 days ago • 11

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published 8 days ago • 13

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published 8 days ago • 11

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 11 days ago • 27

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 8 days ago • 23

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 9 days ago • 56

liked a model 8 days ago

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • Updated 1 day ago • 51k • 83

upvoted a paper 8 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 8 days ago • 118

upvoted a paper 9 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 9 days ago • 150

liked a Space 9 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 12 days ago

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • Updated 11 days ago • 14.1k • 43

liked 2 models 13 days ago

pyannote/speaker-diarization-3.1

Automatic Speech Recognition • Updated May 10, 2024 • 11.4M • 713

ostris/Flex.1-alpha

Text-to-Image • Updated Jan 19 • 26.5k • 390

liked a Space 16 days ago

221

Agent Leaderboard

💬

Ranking of LLMs for agentic tasks

liked a model 16 days ago

nomic-ai/nomic-embed-text-v2-moe

liked a model 18 days ago

PramaLLC/BEN2

Image Segmentation • Updated 16 days ago • 5.52k • 156