Nathan Habib

SaylorTwift

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture Evaluation datasets's profile picture HuggingFaceGECLM's profile picture BigCode's profile picture Hugging Face H4's profile picture BigCode Data's profile picture InternLM's profile picture Hugging Face Smol Cluster's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture Qwen's profile picture gg-hf's profile picture Nanotron Research's profile picture HuggingFaceFW's profile picture HF-contamination-detection's profile picture Top Contributors: Dataset Downloads's profile picture hsramall's profile picture La Leaderboard's profile picture gg-tt's profile picture HuggingFaceEval's profile picture Novel Challenge's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture Lighteval testing org's profile picture open-llm-leaderboard-react's profile picture Prompt Leaderboard's profile picture

SaylorTwift's activity

reacted to elliesleightholm's post with 🤗 about 1 month ago
posted an update about 1 month ago
reacted to Symbol-LLM's post with 🔥 about 1 month ago
view post
Post
945
🥳 Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

📕 Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

🔗 Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

😇Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !
reacted to cfahlgren1's post with ❤️ about 1 month ago
view post
Post
3086
You can clean and format datasets entirely in the browser with a few lines of SQL.

In this post, I replicate the process @mlabonne used to clean the new microsoft/orca-agentinstruct-1M-v1 dataset.

The cleaning process consists of:
- Joining the separate splits together / add split column
- Converting string messages into list of structs
- Removing empty system prompts

https://huggingface.co./blog/cfahlgren1/the-beginners-guide-to-cleaning-a-dataset

Here's his new cleaned dataset: mlabonne/orca-agentinstruct-1M-v1-cleaned
  • 1 reply
·