Ame Vi's picture
6 13

Ame Vi

Ameeeee

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Argilla's profile picture Women on Hugging Face's profile picture Argilla Explorers's profile picture Data Is Better Together's profile picture Social Post Explorers's profile picture HuggingFaceFW-Dev's profile picture Data Is Better Together Contributor's profile picture Bluesky Community's profile picture

Ameeeee's activity

reacted to fdaudens's post with 🔥 1 day ago
view post
Post
2649
Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet?

Open source olmOCR just dropped and the results are impressive.

Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives.

To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images.

Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up.

👉 Try the demo: https://olmocr.allenai.org

Going right into the AI toolkit: JournalistsonHF/ai-toolkit
  • 3 replies
·
reacted to burtenshaw's post with 👍 1 day ago
view post
Post
2532
I made a real time voice agent with FastRTC, smolagents, and hugging face inference providers. Check it out in this space:

🔗 burtenshaw/coworking_agent
·
upvoted an article 3 days ago
view article
Article

Synthetic data: save money, time and carbon with open source

61
upvoted an article 17 days ago
upvoted an article about 1 month ago
view article
Article

Fine-tune ModernBERT for RAG with Synthetic Data

By sdiazlor and 2 others
36
published an article 2 months ago
view article
Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

109
published an article 3 months ago
view article
Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

54