Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

olmOCR

liked a dataset 7 days ago

google/smol

liked a dataset 9 days ago

SakanaAI/AI-CUDA-Engineer-Archive

View all activity

Organizations

ghishadow's activity

upvoted a collection 1 day ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 2 days ago • 46

liked a dataset 7 days ago

google/smol

Viewer • Updated 8 days ago • 811k • 3.09k • 34

liked a dataset 9 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 9 days ago • 30.6k • 10.7k • 128

liked a model 10 days ago

perplexity-ai/r1-1776

Text Generation • Updated 2 days ago • 31.9k • • 1.89k

liked a model 11 days ago

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 11 days ago • 6.73k • 1.03k

liked a dataset 13 days ago

zed-industries/zeta

Viewer • Updated 1 day ago • 583 • 8.72k • 84

liked a model 13 days ago

zed-industries/zeta

Updated 1 day ago • 1.92k • 206

liked 2 models 17 days ago

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 6 days ago • 39.2k • • 487

hexgrad/Kokoro-82M

Text-to-Speech • Updated 1 day ago • 1.29M • 3.47k

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

upvoted a collection about 2 months ago

Gemma Scope Release

Collection

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Dec 13, 2024 • 17

upvoted 2 papers 2 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 93

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

upvoted a collection 2 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 139

liked a model 4 months ago

microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 2.76k • 1.63k

liked a Space 5 months ago

WebGPU Embedding Benchmark

🐠

Measure execution times of BERT models using WebGPU and WASM

liked a model 6 months ago

Mozilla/TriLM-llamafile

Text Generation • Updated Aug 26, 2024 • 492 • 19

upvoted an article 6 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 323

liked 2 models 6 months ago

deepseek-ai/deepseek-coder-6.7b-base

Text Generation • Updated Mar 19, 2024 • 36.3k • 100

Mozilla/whisperfile

Updated Oct 2, 2024 • 2.19k • 241