Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
27.5
TFLOPS
8
37
Nunya Biz
SpyC0der77
Follow
Ghostlyone's profile picture
JackismyShephard's profile picture
2 followers
ยท
21 following
AI & ML interests
None yet
Recent Activity
liked
a Space
about 8 hours ago
webml-community/whisper-large-v3-turbo-webgpu
liked
a Space
about 9 hours ago
JournalistsonHF/ai-toolkit
reacted
to
fdaudens
's
post
with ๐
about 9 hours ago
Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet? Open source olmOCR just dropped and the results are impressive. Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives. To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images. Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up. ๐ Try the demo: https://olmocr.allenai.org Going right into the AI toolkit: https://huggingface.co./spaces/JournalistsonHF/ai-toolkit
View all activity
Organizations
None yet
spaces
21
Sort:ย Recently updated
pinned
Runtime error
ULTIMATE RVC
๐ข
An ap
Build error
YTDLP Docker
๐ข
Runtime error
Image Generation
๐
Running
ChatGPT Ad Maker
๐
Convert images and videos into dot patterns
Runtime error
Sdxl
๐ผ
Runtime error
Model Lora
๐ผ
Expand 21 spaces
models
None public yet
datasets
None public yet