Pierre Dupont's picture
2 2

Pierre Dupont

Hous94
Β·

AI & ML interests

None yet

Recent Activity

Organizations

hackathon team's profile picture

Hous94's activity

published a Space 17 days ago
updated a Space 17 days ago
upvoted 2 articles 6 months ago
view article
Article

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

β€’ 11
view article
Article

The 5 Most Under-Rated Tools on Hugging Face

β€’ 86
reacted to davidberenstein1957's post with πŸ‘€ 7 months ago
view post
Post
1769
πŸ“£ Introducing Dataset Viber: your chill repo for data collection, annotation and vibe checks! πŸŽ‰

I've cooked up Dataset Viber, a set of cool tools designed to make data preparation for AI models easier, more approachable and enjoyable for standalone AI engineers and enthusiasts.

πŸ”§ What Dataset Viber offers:
- CollectorInterface: Lazily collect model interaction data without human annotation
- AnnotatorInterface: Annotate your data with models in the loop
- BulkInterface: Explore data distribution and annotate in bulk
- Embedder: Efficiently embed data with ONNX-optimized speeds

🎯 Key features:
- Supports various tasks for text, chat, and image modalities
- Runs in .ipynb notebooks
- Logs data to local CSV or directly to Hugging Face Hub
- Easy to install via pip: pip install dataset-viber

It's not designed for team collaboration or production use, but rather as a fun and efficient toolkit for individual projects.

Want to give it a try? Check out the repository link https://github.com/davidberenstein1957/dataset-viber/.

I'm excited to hear your feedback and learn how you vibe with your data. Feel free to open an issue or reach out if you have any questions or suggestions!

Some shoutouts:
- Gradio for the amazing backbone
- Daniel van Strien for some initial presentations I did on vibe checks
- Emily Omier for the workshop on structuring GitHub repo READMEs
- Hamel Husain for keeping mentioning that people should look at their data.
- Philipp Schmid for his code for ONNX feature-extractors
- Ben Burtenshaw for the first PR
  • 1 reply
Β·