Clelia Astra Bertelli

as-cle-bert

https://www.cleliasportfolio.xyz

AI & ML interests

Recent Activity

posted an update 2 days ago

I just released a fully automated evaluation framework for your RAG applications!📈 GitHub 👉 https://github.com/AstraBert/diRAGnosis PyPi 👉 https://pypi.org/project/diragnosis/ It's called 𝐝𝐢𝐑𝐀𝐆𝐧𝐨𝐬𝐢𝐬 and is a lightweight framework that helps you 𝗱𝗶𝗮𝗴𝗻𝗼𝘀𝗲 𝘁𝗵𝗲 𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗼𝗳 𝗟𝗟𝗠𝘀 𝗮𝗻𝗱 𝗿𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗺𝗼𝗱𝗲𝗹𝘀 𝗶𝗻 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀. You can launch it as an application locally (it's Docker-ready!🐋) or, if you want more flexibility, you can integrate it in your code as a python package📦 The workflow is simple: 🧠 You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere) 🧠 You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI) 📄 You prepare and provide your documents ⚙️ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex 📊 The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions 📊 The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents And the cool thing is that all of this is 𝗶𝗻𝘁𝘂𝗶𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗹𝘆 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗲𝗱: you plug it in, and it works!🔌⚡ Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds🦙 And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience🕶️ So now it's your turn: you can either get diRAGnosis from GitHub 👉 https://github.com/AstraBert/diRAGnosis or just run a quick and painless: ```bash uv pip install diragnosis ``` To get the package installed (lightning-fast) in your environment🏃‍♀️ Have fun and feel free to leave feedback and feature/integrations requests on GitHub issues✨

commented on their article 3 days ago

streamlit_supabase_auth_ui

commented on their article 16 days ago

streamlit_supabase_auth_ui

View all activity

Organizations

as-cle-bert's activity

posted an update 2 days ago

Post

2387

I just released a fully automated evaluation framework for your RAG applications!📈

GitHub 👉 https://github.com/AstraBert/diRAGnosis
PyPi 👉 https://pypi.org/project/diragnosis/

It's called 𝐝𝐢𝐑𝐀𝐆𝐧𝐨𝐬𝐢𝐬 and is a lightweight framework that helps you 𝗱𝗶𝗮𝗴𝗻𝗼𝘀𝗲 𝘁𝗵𝗲 𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗼𝗳 𝗟𝗟𝗠𝘀 𝗮𝗻𝗱 𝗿𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗺𝗼𝗱𝗲𝗹𝘀 𝗶𝗻 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀.

You can launch it as an application locally (it's Docker-ready!🐋) or, if you want more flexibility, you can integrate it in your code as a python package📦

The workflow is simple:
🧠 You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere)
🧠 You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI)
📄 You prepare and provide your documents
⚙️ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex
📊 The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions
📊 The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents

And the cool thing is that all of this is 𝗶𝗻𝘁𝘂𝗶𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗹𝘆 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗲𝗱: you plug it in, and it works!🔌⚡

Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds🦙
And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience🕶️

So now it's your turn: you can either get diRAGnosis from GitHub 👉 https://github.com/AstraBert/diRAGnosis
or just run a quick and painless:

uv pip install diragnosis

To get the package installed (lightning-fast) in your environment🏃‍♀️

Have fun and feel free to leave feedback and feature/integrations requests on GitHub issues✨

commented on streamlit_supabase_auth_ui 3 days ago

Hi there, just wanted to reach out also here, so that if people see our conversation know that this feature has been integrated: you can now find it in the v0.1.0 of the package, already installable via pip.
Have fun!

commented on streamlit_supabase_auth_ui 16 days ago

I did not specify any configuration, but I'm pretty sure we could play around with Supabase and set a login/logout status for the user (like saying: the user last logged in at time X and logged out at time Y; if Y > X, then the user can login in again, else they cannot).
If you want, I can put it in the roadmap for the next release of the package: then I would ask you to open an issue here: https://github.com/AstraBert/streamlit_supabase_auth_ui/issues so that I can add it to the milestone for v0.1.0 :)

commented on streamlit_supabase_auth_ui 16 days ago

I could not find it either back in the days, when I wanted to suppress it, but my suspicion is that is linked to some not-so-up-to-date portions of the code (the code is based on a repo that used Streamlit 1.34, I believe). Nevertheless, what I did in my personal projects was suppressing all the warnings with:

from warnings import filterwarnings
filterwarnings(action="ignore")
# source -> https://www.geeksforgeeks.org/how-to-disable-python-warnings/

Hope this helps!

commented on streamlit_supabase_auth_ui 17 days ago

Hi! Yes, the code is open and you can modify it for your projects :)
If you want to change the language of the components, you just need to modify the widget.py script, i.e. https://github.com/AstraBert/streamlit_supabase_auth_ui/blob/main/streamlit_supabase_auth_ui/widgets.py

posted an update 18 days ago

Post

2365

I built an AI agent app in less than 8 hours🤯
And, believe me, this is 𝗻𝗼𝘁 clickbait❌

GitHub 👉 https://github.com/AstraBert/PapersChat
Demo 👉 as-cle-bert/PapersChat

The app is called 𝐏𝐚𝐩𝐞𝐫𝐬𝐂𝐡𝐚𝐭, and it is aimed at 𝗺𝗮𝗸𝗶𝗻𝗴 𝗰𝗵𝗮𝘁𝘁𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝘀𝗰𝗶𝗲𝗻𝘁𝗶𝗳𝗶𝗰 𝗽𝗮𝗽𝗲𝗿𝘀 𝗲𝗮𝘀𝗶𝗲𝗿.

𝐇𝐞𝐫𝐞 𝐢𝐬 𝐰𝐡𝐚𝐭 𝐭𝐡𝐞 𝐚𝐩𝐩 𝐝𝐨𝐞𝐬:

📄 Parses the papers that you upload thanks to LlamaIndex🦙 (either with LlamaParse or with simpler, local methods)
📄 Embeds documents both with a sparse and with a dense encoder to enable hybrid search
📄 Uploads the embeddings to Qdrant
⚙️ Activates an Agent based on mistralai/Mistral-Small-24B-Instruct-2501 that will reply to your prompt
🧠 Retrieves information relevant to your question from the documents
🧠 If no relevant information is found, it searches PubMed and arXiv databases
🧠 Returns a grounded answer to your prompt

𝐇𝐨𝐰 𝐝𝐢𝐝 𝐈 𝐦𝐚𝐧𝐚𝐠𝐞 𝐭𝐨 𝐦𝐚𝐤𝐞 𝐭𝐡𝐢𝐬 𝐚𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐢𝐧 𝟖 𝐡𝐨𝐮𝐫𝐬?

Three key points:

- LlamaIndex🦙 provides countless integrations with LLM providers, text embedding models and vectorstore services, and takes care of the internal architecture of the Agent. You just plug it in, and it works!🔌⚡
- Qdrant is a vector database service extremely easy to set up and use: you just need a one-line Docker command😉
- Gradio makes frontend development painless and fast, while still providing modern and responsive interfaces🏗️

And a bonus point:

- Deploying the demo app couldn't be easier if you use Gradio-based Hugging Face Spaces🤗

So, no more excuses: build your own AI agent today and do it fast, (almost) for free and effortlessly🚀

And if you need a starting point, the code for PapersChat is open and fully reproducible on GitHub 👉 https://github.com/AstraBert/PapersChat

updated a Space 19 days ago

PapersChat

📚

Chatting with scientific papers made easy

liked a Space 19 days ago

PapersChat

📚

Chatting with scientific papers made easy

published a Space 19 days ago

PapersChat

📚

Chatting with scientific papers made easy

posted an update 22 days ago

Post

1388

𝐒𝐜𝐢𝐍𝐞𝐰𝐬𝐁𝐨𝐭 - 𝐑𝐞𝐩𝐨𝐫𝐭 𝐝𝐚𝐢𝐥𝐲 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐧𝐞𝐰𝐬 𝐨𝐧 𝐁𝐥𝐮𝐞𝐒𝐤𝐲

GitHub 👉 https://github.com/AstraBert/SciNewsBot
BlueSky 👉 https://bsky.app/profile/sci-news-bot.bsky.social

Hi there HF Community!🤗
I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)

The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky📰

The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup🐋

Have fun and spread Science!✨

posted an update 25 days ago

Post

2752

𝐏𝐡𝐢𝐐𝐰𝐞𝐧𝐒𝐓𝐄𝐌 - 𝐚 𝐫𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐬𝐬𝐢𝐬𝐭𝐚𝐧𝐭 𝐟𝐨𝐫 𝐲𝐨𝐮𝐫 𝐒𝐓𝐄𝐌 𝐞𝐝𝐮𝐜𝐚𝐭𝐢𝐨𝐧

Demo 👉 https://pqstem.org
GitHub 👉 https://github.com/AstraBert/PhiQwenSTEM

Hello HF community!🤗
Ever struggled with some complex Maths problem or with a very hard Physics question? Well, fear no more, because now you can rely on PhiQwenSTEM, an assistant specialized in answering STEM-related question!
The assistant can count on a knowledge base of 𝟭𝟱𝗸+ 𝘀𝗲𝗹𝗲𝗰𝘁𝗲𝗱 𝗦𝗧𝗘𝗠 𝗾𝘂𝗲𝘀𝘁𝗶𝗼𝗻-𝗮𝗻𝘀𝘄𝗲𝗿 𝗽𝗮𝗶𝗿𝘀 spanning the domains of Chemistry, Physics, Matemathics and Biochemistry (from EricLu/SCP-116K). It also relies on the combined power of microsoft/Phi-3.5-mini-instruct and Qwen/QwQ-32B-Preview to produce reliable and reasoned answers.
For the next 30 days, you will be able to try for free the web demo: https://pqstem.org
In the GitHub repo you can find all the information to reproduce PhiQwenSTEM 𝗼𝗻 𝘆𝗼𝘂𝗿 𝗹𝗼𝗰𝗮𝗹 𝗺𝗮𝗰𝗵𝗶𝗻𝗲, 𝗯𝗼𝘁𝗵 𝘃𝗶𝗮 𝘀𝗼𝘂𝗿𝗰𝗲 𝗰𝗼𝗱𝗲 𝗮𝗻𝗱 𝘄𝗶𝘁𝗵 𝗮 𝗰𝗼𝗺𝗳𝘆 𝗗𝗼𝗰𝗸𝗲𝗿🐋 𝘀𝗲𝘁𝘂𝗽: https://github.com/AstraBert/PhiQwenSTEM

liked a dataset about 1 month ago

EricLu/SCP-116K

Viewer • Updated 30 days ago • 117k • 959 • 69

upvoted an article about 1 month ago

Article

Why we (don't) need export control

•

Feb 1

• 8

posted an update about 1 month ago

Post

1033

Hi HuggingFace community!🤗

I just published an article in which I try to articulate some counter-points to Dario Amodei's post "On DeepSeek and Export Control"👉 https://huggingface.co./blog/as-cle-bert/why-we-dont-need-export-control

I try to address several key passages of the third section from Amodei's post (https://darioamodei.com/on-deepseek-and-export-controls), bringing my perspective on the importance of open source, open knowledge and multipolarity in a crucial field for our future such as Artificial Intelligence.

Happy reading!✨