Jean Louis

JLouisBiz

https://www.StartYourOwnGoldMine.com

AI & ML interests

- LLM for sales, marketing, promotion - LLM for Website Revision System - increasing quality of communication with customers - helping clients access information faster - saving people from financial troubles

Recent Activity

replied to Kseniase's post about 2 hours ago

5 New implementations of Diffusion Models Diffusion models are widely used for image and video generation but remain underexplored in text generation, where autoregressive models (ARMs) dominate. Unlike ARMs, which produce tokens sequentially, diffusion models iteratively refine noise through denoising steps, offering greater flexibility and speed. Recent advancements show a shift toward using diffusion models in place of, or alongside, ARMs. Researchers also combine strengths from both methods and integrate autoregressive concepts into diffusion. Here are 5 new implementations of diffusion models: 1. Mercury family of diffusion LLMs (dLLMs) by Inception Labs -> https://www.inceptionlabs.ai/news It applies diffusion to text and code data, enabling sequence generation 10x faster than today's top LLMs. Now available Mercury Coder can run at over 1,000 tokens/sec on NVIDIA H100s. 2. Diffusion of Thoughts (DoT) -> https://huggingface.co./papers/2402.07754 Integrates diffusion models with Chain-of-Thought. DoT allows reasoning steps to diffuse gradually over time. This flexibility enables balancing between reasoning quality and computational cost. 3. LLaDA -> https://huggingface.co./papers/2502.09992 Shows diffusion models' potential in replacing ARMs. Trained with pre-training and SFT, LLaDA masks tokens, predicts them via a Transformer, and optimizes a likelihood bound. LLaDA matches key LLM skills, and surpasses GPT-4o in reversal poetry. 4. LanDiff -> https://huggingface.co./papers/2503.04606 This hybrid text-to-video model combines autoregressive and diffusion paradigms, introducing a semantic tokenizer, an LM for token generation, and a streaming diffusion model. LanDiff outperforms models like Sora. 5. General Interpolating Discrete Diffusion (GIDD) -> https://huggingface.co./papers/2503.04482 A flexible noising process with a novel diffusion ELBO enables combining masking and uniform noise, allowing diffusion models to correct mistakes, where ARMs struggle.

new activity about 18 hours ago

eaddario/Watt-Tool-8B-GGUF:Problem with the license, this is not really free software

new activity about 23 hours ago

meditsolutions/medit-one-140M-9B-tokens-checkpoint:Question on meaning of parameter of this model

View all activity

Organizations

JLouisBiz's activity

replied to Kseniase's post about 2 hours ago

Are there any Free Software diffuction LLMs on Hugging Face yet?

New activity in eaddario/Watt-Tool-8B-GGUF about 18 hours ago

Problem with the license, this is not really free software

#1 opened 5 days ago by

JLouisBiz

New activity in meditsolutions/medit-one-140M-9B-tokens-checkpoint about 23 hours ago

Question on meaning of parameter of this model

#2 opened 1 day ago by

JLouisBiz

New activity in meditsolutions/medit-one-140M-9B-tokens-checkpoint 1 day ago

Can't install

#1 opened 3 days ago by

JLouisBiz

updated a collection 3 days ago

Free Software Models

Collection

Only fully free software models as by definition: https://www.gnu.org/philosophy/free-sw.html • 56 items • Updated 3 days ago • 1

reacted to mkurman's post with 👍 3 days ago

Post

2306

MedIT One 140M Fifth checkpoint after 9B tokens
meditsolutions/medit-one-140M-9B-tokens-checkpoint

updated a collection 3 days ago

Free Software Models

Collection

Only fully free software models as by definition: https://www.gnu.org/philosophy/free-sw.html • 56 items • Updated 3 days ago • 1

reacted to as-cle-bert's post with 👍 3 days ago

Post

2422

I just released a fully automated evaluation framework for your RAG applications!📈

GitHub 👉 https://github.com/AstraBert/diRAGnosis
PyPi 👉 https://pypi.org/project/diragnosis/

It's called 𝐝𝐢𝐑𝐀𝐆𝐧𝐨𝐬𝐢𝐬 and is a lightweight framework that helps you 𝗱𝗶𝗮𝗴𝗻𝗼𝘀𝗲 𝘁𝗵𝗲 𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗼𝗳 𝗟𝗟𝗠𝘀 𝗮𝗻𝗱 𝗿𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗺𝗼𝗱𝗲𝗹𝘀 𝗶𝗻 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀.

You can launch it as an application locally (it's Docker-ready!🐋) or, if you want more flexibility, you can integrate it in your code as a python package📦

The workflow is simple:
🧠 You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere)
🧠 You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI)
📄 You prepare and provide your documents
⚙️ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex
📊 The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions
📊 The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents

And the cool thing is that all of this is 𝗶𝗻𝘁𝘂𝗶𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗹𝘆 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗲𝗱: you plug it in, and it works!🔌⚡

Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds🦙
And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience🕶️

So now it's your turn: you can either get diRAGnosis from GitHub 👉 https://github.com/AstraBert/diRAGnosis
or just run a quick and painless:

uv pip install diragnosis

To get the package installed (lightning-fast) in your environment🏃‍♀️

Have fun and feel free to leave feedback and feature/integrations requests on GitHub issues✨

New activity in marathi-llm/MahaMarathi-7B-v24.01-Base 3 days ago

You got a serious licensing problem

#8 opened 3 days ago by

JLouisBiz

New activity in Tower-Babel/Babel-9B-Chat 3 days ago

Can you publish it under free software license like MIT, Apache 2.0 or some other?

#2 opened 3 days ago by

JLouisBiz

updated a collection 3 days ago

Free Software Models

Collection

Only fully free software models as by definition: https://www.gnu.org/philosophy/free-sw.html • 56 items • Updated 3 days ago • 1

New activity in perplexity-ai/r1-1776 3 days ago

For anyone who is wondering what is going on here with all the "reports"

#168 opened 16 days ago by

ufwd1984

New activity in utter-project/EuroLLM-9B-Instruct 3 days ago

Disable "gated access", it is Apache 2

#6 opened 3 months ago by

kno10

replied to ZennyKenny's post 4 days ago

Meta’s LLaMa 2 license is not Open Source – Open Source Initiative
https://opensource.org/blog/metas-llama-2-license-is-not-open-source

Not free, can't use. Can you use some free as in freedom model and make such a model?

liked a model 5 days ago

suayptalha/Luminis-phi-4

Text Generation • Updated 15 days ago • 2.49k • 11

New activity in perplexity-ai/r1-1776 5 days ago

USA/West Propaganda hugging face of huggingface

#230 opened 13 days ago by

devops724

replied to sequelbox's post 5 days ago

That is long time 18 hours to get an answer

reacted to AdinaY's post with 👍 5 days ago

Post

2681

CogView-4 is out🔥🚀 The SoTa OPEN text to image model by ZhipuAI

Model: THUDM/CogView4-6B
Demo: THUDM-HF-SPACE/CogView4

✨ 6B with Apache2.0
✨ Supports Chinese & English Prompts by ANY length
✨ Generate Chinese characters within images
✨ Creates images at any resolution within a given range

replied to eaddario's post 5 days ago

Thanks, how can I reproduce it? I am using llama.cpp, do you maybe have a recipe?

updated a collection 6 days ago

Free Software Models

Collection

Only fully free software models as by definition: https://www.gnu.org/philosophy/free-sw.html • 56 items • Updated 3 days ago • 1