AI & ML interests

Critical Thinking 🤝 Generative AI

Recent Activity

ggbetz  updated a Space 2 days ago
logikon/open_cot_leaderboard
ggbetz  updated a Space 4 months ago
logikon/open_cot_leaderboard
ggbetz  updated a collection 4 months ago
Critical Thinking 🤝 LLMs Papers
View all activity

Articles

logikon's activity

ggbetz 
posted an update 24 days ago
ggbetz 
posted an update 6 months ago
view post
Post
1496
Hi, just a brief follow-up on our Guided Reasoning (GuiR) system:

I've created a template space that facilitates testing:

1. Duplicate space logikon/guir-chat
2. Setup your own inference servers and provide details in config file
3. Add api keys as secrets
4. Your personal GuiR playground is ready

Cheers, Gregor
ggbetz 
posted an update 6 months ago
view post
Post
1202
🧭 Guided Reasoning

👋Hi everyone,

We've been releasing Guided Reasoning:

Our AI guides walk your favorite LLM through complex reasoning problems.

🎯 Goals:

1️⃣ Reliability. AIs consistently follow reasoning methods.
2️⃣ Self-explainability. AIs see reasoning protocols and can explain internal deliberation.
3️⃣ Contestability. Users may amend AI reasoning and revise plausibility assessments.

Try out Guided Reasoning with our light demo chatbot, powered by 🤗 HuggingFace's free Inference Api and small LLMs. (Sorry for poor latency and limited availability -- we are currently searching for 💸 compute sponsors to run more powerful models, faster, and optimize guided reasoning performance.)

Built on top of Logikon's open-source AI reasoning analytics.

Demo chat app: logikon/benjamin-chat
Github: https://github.com/logikon-ai/logikon
Technical report: https://arxiv.org/abs/2408.16331

➡️ Check it out and get involved! Looking forward to hearing from you.
ggbetz 
posted an update 11 months ago
view post
Post
1446
🥇Open CoT Leaderboard

We're delighted to announce the [Open CoT Leaderboard]( logikon/open_cot_leaderboard) on 🤗 Spaces.

Unlike other LLM performance leaderboards, the Open CoT Leaderboard is not tracking absolute benchmark accuracies, but relative **accuracy gains** due to **chain-of-thought**.

Eval datasets that underpin the leaderboard are hosted [here](https://huggingface.co./cot-leaderboard).

Feedback and suggestions more than welcome.

@clefourrier
·