25 166 575

Florent Daudens

fdaudens

AI & ML interests

AI & Journalism

Recent Activity

posted an update about 2 hours ago

What if AI becomes as ubiquitous as the internet, but runs locally and transparently on our devices? Fascinating TED talk by @thomwolf on open source AI and its future impact. Imagine this for AI: instead of black box models running in distant data centers, we get transparent AI that runs locally on our phones and laptops, often without needing internet access. If the original team moves on? No problem - resilience is one of the beauties of open source. Anyone (companies, collectives, or individuals) can adapt and fix these models. This is a compelling vision of AI's future that solves many of today's concerns around AI transparency and centralized control. Watch the full talk here: https://www.ted.com/talks/thomas_wolf_what_if_ai_just_works

liked a Space about 14 hours ago

Wan-AI/Wan2.1

updated a Space about 21 hours ago

JournalistsonHF/ai-toolkit

View all activity

Organizations

fdaudens's activity

posted an update about 2 hours ago

Post

124

What if AI becomes as ubiquitous as the internet, but runs locally and transparently on our devices?

Fascinating TED talk by @thomwolf on open source AI and its future impact.

Imagine this for AI: instead of black box models running in distant data centers, we get transparent AI that runs locally on our phones and laptops, often without needing internet access. If the original team moves on? No problem - resilience is one of the beauties of open source. Anyone (companies, collectives, or individuals) can adapt and fix these models.

This is a compelling vision of AI's future that solves many of today's concerns around AI transparency and centralized control.

Watch the full talk here: https://www.ted.com/talks/thomas_wolf_what_if_ai_just_works

liked a Space about 14 hours ago

560

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

updated a Space about 21 hours ago

The Essential AI Toolkit

🧰

A curated collection of AI tools for journalists & creators

liked a Space about 21 hours ago

PhineSpeechTranslator

👀

Break the language barrier

liked a model about 21 hours ago

microsoft/Phi-4-mini-instruct

Text Generation • Updated about 9 hours ago • 9.11k • 180

liked a Space about 21 hours ago

628

Open ASR Leaderboard

🏆

Request evaluation for speech models

upvoted a collection about 24 hours ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 1 day ago • 46

liked a model 1 day ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated about 16 hours ago • 7.35k • 500

liked a Space 1 day ago

Phi4 Multimodal

🦀

Space demoing Phi4 MultiModal

posted an update 1 day ago

Post

2430

Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet?

Open source olmOCR just dropped and the results are impressive.

Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives.

To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images.

Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up.

👉 Try the demo: https://olmocr.allenai.org

Going right into the AI toolkit: JournalistsonHF/ai-toolkit

3 replies

upvoted a collection 1 day ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated about 5 hours ago • 82

upvoted an article 3 days ago

Article

FastRTC: The Real-Time Communication Library for Python

4 days ago

• 96

updated a Space 4 days ago

README

😻

A hub for journalists exploring AI in news media

liked a Space 4 days ago

The Essential AI Toolkit

🧰

A curated collection of AI tools for journalists & creators

posted an update 4 days ago

Post

3162

🚀 Just launched: A toolkit of 20 powerful AI tools that journalists can use right now - transcribe, analyze, create. 100% free & open-source.

Been testing all these tools myself and created a searchable collection of the most practical ones - from audio transcription to image generation to document analysis. No coding needed, no expensive subscriptions.

Some highlights I've tested personally:
- Private, on-device transcription with speaker ID in 100+ languages using Whisper
- Website scraping that just works - paste a URL, get structured data
- Local image editing with tools like Finegrain (impressive results)
- Document chat using Qwen 2.5 72B (handles technical papers well)

Sharing this early because the best tools come from the community. Drop your favorite tools in the comments or join the discussion on what to add next!

👉 JournalistsonHF/ai-toolkit

liked a model 5 days ago

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 2 days ago • 2.61k • 117

published a Space 6 days ago

The Essential AI Toolkit

🧰

A curated collection of AI tools for journalists & creators

posted an update 7 days ago

Post

3445

Trying something new to keep you ahead of the curve: The 5 AI stories of the week - a weekly curation of the most important AI news you need to know. Do you like it?

For more AI stories and deeper analysis, check out my newsletter: https://open.substack.com/pub/fdaudens/p/ai-competition-heats-up-grok-3-iphone