Joseph [open/acc] Pollack's picture

Joseph [open/acc] Pollack

Tonic

AI & ML interests

🤖Making robots to help people learn things quicker 👩🏻‍🚀🚀

Recent Activity

updated a Space about 10 hours ago
fffiloni/Sa2VA-simple-demo
liked a Space about 10 hours ago
fffiloni/Sa2VA-simple-demo
View all activity

Articles

Organizations

MISATO-dataset's profile picture Masakhane NLP's profile picture LangChain Agents Hub's profile picture LangChain Chains Hub's profile picture BigScience Biomedical Datasets's profile picture LangChainDatasets's profile picture OpenVINO Toolkit's profile picture Gradio-Blocks-Party's profile picture DeepGHS's profile picture The introspector project's profile picture Pseudo Lab's profile picture LangChain Hub Prompts's profile picture The Waifu Research Department's profile picture Blog-explorers's profile picture Tonic AI's profile picture OpenLLM France's profile picture Multi🤖Transformers's profile picture Qwen's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture SaprotHub's profile picture The Hydra Project's profile picture Copyleft Cultivars's profile picture Argilla Explorers's profile picture the collabage patch's profile picture Social Post Explorers's profile picture C4AI Community's profile picture AIffl : AI For French Language's profile picture M4-ai's profile picture takara.ai's profile picture Dev Mode Explorers's profile picture Quasar Research's profile picture Chinese LLMs on Hugging Face's profile picture Hugging Face for Legal's profile picture Hugging Face Discord Community's profile picture Seq-to-Pheno's profile picture Data Tonic (Alignment Lab)'s profile picture Nerdy Face's profile picture Intelligent Estate's profile picture open/ acc's profile picture Mistral AI Game Jam's profile picture La Mousse's profile picture

Tonic's activity

New activity in fffiloni/Sa2VA-simple-demo about 10 hours ago

fixes 500 error for some users

#1 opened about 10 hours ago by
Tonic
reacted to chansung's post with 👍 about 20 hours ago
view post
Post
1663
Simple summarization of Evolving Deeper LLM Thinking (Google DeepMind)

The process starts by posing a question.
1) The LLM generates initial responses.
2) These generated responses are evaluated according to specific criteria (program-based checker).
3) The LLM critiques the evaluated results.
4) The LLM refines the responses based on the evaluation, critique, and original responses.

The refined response is then fed back into step 2). If it meets the criteria, the process ends. Otherwise, the algorithm generates more responses based on the refined ones (with some being discarded, some remaining, and some responses potentially being merged).

Through this process, it demonstrated excellent performance in complex scheduling problems (travel planning, meeting scheduling, etc.). It's a viable method for finding highly effective solutions in specific scenarios.

However, there are two major drawbacks:
🤔 An excessive number of API calls are required. (While the cost might not be very high, it leads to significant latency.)
🤔 The evaluator is program-based. (This limits its use as a general method. It could potentially be modified/implemented using LLM as Judge, but that would introduce additional API costs for evaluation.)

https://arxiv.org/abs/2501.09891
liked a Space 2 days ago