Clelia Astra Bertelli's picture

Clelia Astra Bertelli

as-cle-bert

AI & ML interests

Biology + Artificial Intelligence = โค๏ธ | AI for sustainable development, sustainable development for AI | Researching on Machine Learning Enhancement | I love automation for everyday things | Blogger | Open Source

Recent Activity

posted an update 2 days ago
I just released a fully automated evaluation framework for your RAG applications!๐Ÿ“ˆ GitHub ๐Ÿ‘‰ https://github.com/AstraBert/diRAGnosis PyPi ๐Ÿ‘‰ https://pypi.org/project/diragnosis/ It's called ๐๐ข๐‘๐€๐†๐ง๐จ๐ฌ๐ข๐ฌ and is a lightweight framework that helps you ๐—ฑ๐—ถ๐—ฎ๐—ด๐—ป๐—ผ๐˜€๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ ๐—ผ๐—ณ ๐—Ÿ๐—Ÿ๐— ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ฟ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐—ถ๐—ป ๐—ฅ๐—”๐—š ๐—ฎ๐—ฝ๐—ฝ๐—น๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€. You can launch it as an application locally (it's Docker-ready!๐Ÿ‹) or, if you want more flexibility, you can integrate it in your code as a python package๐Ÿ“ฆ The workflow is simple: ๐Ÿง  You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere) ๐Ÿง  You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI) ๐Ÿ“„ You prepare and provide your documents โš™๏ธ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex ๐Ÿ“Š The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions ๐Ÿ“Š The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents And the cool thing is that all of this is ๐—ถ๐—ป๐˜๐˜‚๐—ถ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—น๐—ฒ๐˜๐—ฒ๐—น๐˜† ๐—ฎ๐˜‚๐˜๐—ผ๐—บ๐—ฎ๐˜๐—ฒ๐—ฑ: you plug it in, and it works!๐Ÿ”Œโšก Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds๐Ÿฆ™ And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience๐Ÿ•ถ๏ธ So now it's your turn: you can either get diRAGnosis from GitHub ๐Ÿ‘‰ https://github.com/AstraBert/diRAGnosis or just run a quick and painless: ```bash uv pip install diragnosis ``` To get the package installed (lightning-fast) in your environment๐Ÿƒโ€โ™€๏ธ Have fun and feel free to leave feedback and feature/integrations requests on GitHub issuesโœจ
commented on their article 3 days ago
streamlit_supabase_auth_ui
commented on their article 16 days ago
streamlit_supabase_auth_ui
View all activity

Organizations

Social Post Explorers's profile picture Hugging Face Discord Community's profile picture GreenFit AI's profile picture

as-cle-bert's activity

posted an update 2 days ago
view post
Post
2387
I just released a fully automated evaluation framework for your RAG applications!๐Ÿ“ˆ

GitHub ๐Ÿ‘‰ https://github.com/AstraBert/diRAGnosis
PyPi ๐Ÿ‘‰ https://pypi.org/project/diragnosis/

It's called ๐๐ข๐‘๐€๐†๐ง๐จ๐ฌ๐ข๐ฌ and is a lightweight framework that helps you ๐—ฑ๐—ถ๐—ฎ๐—ด๐—ป๐—ผ๐˜€๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฒ๐—ฟ๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐—ป๐—ฐ๐—ฒ ๐—ผ๐—ณ ๐—Ÿ๐—Ÿ๐— ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ฟ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐—ถ๐—ป ๐—ฅ๐—”๐—š ๐—ฎ๐—ฝ๐—ฝ๐—น๐—ถ๐—ฐ๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐˜€.

You can launch it as an application locally (it's Docker-ready!๐Ÿ‹) or, if you want more flexibility, you can integrate it in your code as a python package๐Ÿ“ฆ

The workflow is simple:
๐Ÿง  You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere)
๐Ÿง  You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI)
๐Ÿ“„ You prepare and provide your documents
โš™๏ธ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex
๐Ÿ“Š The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions
๐Ÿ“Š The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents

And the cool thing is that all of this is ๐—ถ๐—ป๐˜๐˜‚๐—ถ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ผ๐—บ๐—ฝ๐—น๐—ฒ๐˜๐—ฒ๐—น๐˜† ๐—ฎ๐˜‚๐˜๐—ผ๐—บ๐—ฎ๐˜๐—ฒ๐—ฑ: you plug it in, and it works!๐Ÿ”Œโšก

Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds๐Ÿฆ™
And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience๐Ÿ•ถ๏ธ

So now it's your turn: you can either get diRAGnosis from GitHub ๐Ÿ‘‰ https://github.com/AstraBert/diRAGnosis
or just run a quick and painless:

uv pip install diragnosis


To get the package installed (lightning-fast) in your environment๐Ÿƒโ€โ™€๏ธ

Have fun and feel free to leave feedback and feature/integrations requests on GitHub issuesโœจ
commented on streamlit_supabase_auth_ui 3 days ago
view reply

Hi there, just wanted to reach out also here, so that if people see our conversation know that this feature has been integrated: you can now find it in the v0.1.0 of the package, already installable via pip.
Have fun!

commented on streamlit_supabase_auth_ui 16 days ago
view reply

I did not specify any configuration, but I'm pretty sure we could play around with Supabase and set a login/logout status for the user (like saying: the user last logged in at time X and logged out at time Y; if Y > X, then the user can login in again, else they cannot).
If you want, I can put it in the roadmap for the next release of the package: then I would ask you to open an issue here: https://github.com/AstraBert/streamlit_supabase_auth_ui/issues so that I can add it to the milestone for v0.1.0 :)

commented on streamlit_supabase_auth_ui 16 days ago
view reply

I could not find it either back in the days, when I wanted to suppress it, but my suspicion is that is linked to some not-so-up-to-date portions of the code (the code is based on a repo that used Streamlit 1.34, I believe). Nevertheless, what I did in my personal projects was suppressing all the warnings with:

from warnings import filterwarnings
filterwarnings(action="ignore")
# source -> https://www.geeksforgeeks.org/how-to-disable-python-warnings/

Hope this helps!

commented on streamlit_supabase_auth_ui 17 days ago
posted an update 18 days ago
view post
Post
2365
I built an AI agent app in less than 8 hours๐Ÿคฏ
And, believe me, this is ๐—ป๐—ผ๐˜ clickbaitโŒ

GitHub ๐Ÿ‘‰ https://github.com/AstraBert/PapersChat
Demo ๐Ÿ‘‰ as-cle-bert/PapersChat

The app is called ๐๐š๐ฉ๐ž๐ซ๐ฌ๐‚๐ก๐š๐ญ, and it is aimed at ๐—บ๐—ฎ๐—ธ๐—ถ๐—ป๐—ด ๐—ฐ๐—ต๐—ฎ๐˜๐˜๐—ถ๐—ป๐—ด ๐˜„๐—ถ๐˜๐—ต ๐˜€๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—ถ๐—ณ๐—ถ๐—ฐ ๐—ฝ๐—ฎ๐—ฝ๐—ฒ๐—ฟ๐˜€ ๐—ฒ๐—ฎ๐˜€๐—ถ๐—ฒ๐—ฟ.

๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ฐ๐ก๐š๐ญ ๐ญ๐ก๐ž ๐š๐ฉ๐ฉ ๐๐จ๐ž๐ฌ:

๐Ÿ“„ Parses the papers that you upload thanks to LlamaIndex๐Ÿฆ™ (either with LlamaParse or with simpler, local methods)
๐Ÿ“„ Embeds documents both with a sparse and with a dense encoder to enable hybrid search
๐Ÿ“„ Uploads the embeddings to Qdrant
โš™๏ธ Activates an Agent based on mistralai/Mistral-Small-24B-Instruct-2501 that will reply to your prompt
๐Ÿง  Retrieves information relevant to your question from the documents
๐Ÿง  If no relevant information is found, it searches PubMed and arXiv databases
๐Ÿง  Returns a grounded answer to your prompt

๐‡๐จ๐ฐ ๐๐ข๐ ๐ˆ ๐ฆ๐š๐ง๐š๐ ๐ž ๐ญ๐จ ๐ฆ๐š๐ค๐ž ๐ญ๐ก๐ข๐ฌ ๐š๐ฉ๐ฉ๐ฅ๐ข๐œ๐š๐ญ๐ข๐จ๐ง ๐ข๐ง ๐Ÿ– ๐ก๐จ๐ฎ๐ซ๐ฌ?

Three key points:

- LlamaIndex๐Ÿฆ™ provides countless integrations with LLM providers, text embedding models and vectorstore services, and takes care of the internal architecture of the Agent. You just plug it in, and it works!๐Ÿ”Œโšก
- Qdrant is a vector database service extremely easy to set up and use: you just need a one-line Docker command๐Ÿ˜‰
- Gradio makes frontend development painless and fast, while still providing modern and responsive interfaces๐Ÿ—๏ธ

And a bonus point:

- Deploying the demo app couldn't be easier if you use Gradio-based Hugging Face Spaces๐Ÿค—

So, no more excuses: build your own AI agent today and do it fast, (almost) for free and effortlessly๐Ÿš€

And if you need a starting point, the code for PapersChat is open and fully reproducible on GitHub ๐Ÿ‘‰ https://github.com/AstraBert/PapersChat
posted an update 22 days ago
view post
Post
1388
๐’๐œ๐ข๐๐ž๐ฐ๐ฌ๐๐จ๐ญ - ๐‘๐ž๐ฉ๐จ๐ซ๐ญ ๐๐š๐ข๐ฅ๐ฒ ๐’๐œ๐ข๐ž๐ง๐œ๐ž ๐ง๐ž๐ฐ๐ฌ ๐จ๐ง ๐๐ฅ๐ฎ๐ž๐’๐ค๐ฒ

GitHub ๐Ÿ‘‰ https://github.com/AstraBert/SciNewsBot
BlueSky ๐Ÿ‘‰ https://bsky.app/profile/sci-news-bot.bsky.social

Hi there HF Community!๐Ÿค—
I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)

The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky๐Ÿ“ฐ

The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup๐Ÿ‹

Have fun and spread Science!โœจ
posted an update 25 days ago
view post
Post
2752
๐๐ก๐ข๐๐ฐ๐ž๐ง๐’๐“๐„๐Œ - ๐š ๐ซ๐ž๐š๐ฌ๐จ๐ง๐ข๐ง๐  ๐š๐ฌ๐ฌ๐ข๐ฌ๐ญ๐š๐ง๐ญ ๐Ÿ๐จ๐ซ ๐ฒ๐จ๐ฎ๐ซ ๐’๐“๐„๐Œ ๐ž๐๐ฎ๐œ๐š๐ญ๐ข๐จ๐ง

Demo ๐Ÿ‘‰ https://pqstem.org
GitHub ๐Ÿ‘‰ https://github.com/AstraBert/PhiQwenSTEM

Hello HF community!๐Ÿค—
Ever struggled with some complex Maths problem or with a very hard Physics question? Well, fear no more, because now you can rely on PhiQwenSTEM, an assistant specialized in answering STEM-related question!
The assistant can count on a knowledge base of ๐Ÿญ๐Ÿฑ๐—ธ+ ๐˜€๐—ฒ๐—น๐—ฒ๐—ฐ๐˜๐—ฒ๐—ฑ ๐—ฆ๐—ง๐—˜๐—  ๐—พ๐˜‚๐—ฒ๐˜€๐˜๐—ถ๐—ผ๐—ป-๐—ฎ๐—ป๐˜€๐˜„๐—ฒ๐—ฟ ๐—ฝ๐—ฎ๐—ถ๐—ฟ๐˜€ spanning the domains of Chemistry, Physics, Matemathics and Biochemistry (from EricLu/SCP-116K). It also relies on the combined power of microsoft/Phi-3.5-mini-instruct and Qwen/QwQ-32B-Preview to produce reliable and reasoned answers.
For the next 30 days, you will be able to try for free the web demo: https://pqstem.org
In the GitHub repo you can find all the information to reproduce PhiQwenSTEM ๐—ผ๐—ป ๐˜†๐—ผ๐˜‚๐—ฟ ๐—น๐—ผ๐—ฐ๐—ฎ๐—น ๐—บ๐—ฎ๐—ฐ๐—ต๐—ถ๐—ป๐—ฒ, ๐—ฏ๐—ผ๐˜๐—ต ๐˜ƒ๐—ถ๐—ฎ ๐˜€๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ ๐—ฐ๐—ผ๐—ฑ๐—ฒ ๐—ฎ๐—ป๐—ฑ ๐˜„๐—ถ๐˜๐—ต ๐—ฎ ๐—ฐ๐—ผ๐—บ๐—ณ๐˜† ๐——๐—ผ๐—ฐ๐—ธ๐—ฒ๐—ฟ๐Ÿ‹ ๐˜€๐—ฒ๐˜๐˜‚๐—ฝ: https://github.com/AstraBert/PhiQwenSTEM
upvoted an article about 1 month ago
posted an update about 1 month ago
view post
Post
1033
Hi HuggingFace community!๐Ÿค—

I just published an article in which I try to articulate some counter-points to Dario Amodei's post "On DeepSeek and Export Control"๐Ÿ‘‰ https://huggingface.co./blog/as-cle-bert/why-we-dont-need-export-control

I try to address several key passages of the third section from Amodei's post (https://darioamodei.com/on-deepseek-and-export-controls), bringing my perspective on the importance of open source, open knowledge and multipolarity in a crucial field for our future such as Artificial Intelligence.

Happy reading!โœจ
published an article about 1 month ago