victor (Victor Mustar)

liked a model about 18 hours ago

CohereForAI/aya-vision-8b

Image-Text-to-Text • Updated 7 days ago • 144k • 233

liked a model about 21 hours ago

motimalu/wan-flat-color-v2

Text-to-Image • Updated about 1 hour ago • 9 • 9

liked 4 Spaces about 23 hours ago

39

Stable Diffusion 3.5 Large TurboX

🏃

SD3.5 in 8-steps with TensorArt TurboX

23

Tight Inversion Pulid Demo

🐨

Generate high-quality edited images from portrait prompts and IDs

25

Skyreels A1 Talking Head

😻

Audio to Talking Face

130

Spark TTS

🌖

A text-to-speech model powered by SparkAudio and Mobvoi.

upvoted a collection 1 day ago

YuLan-Mini

Collection

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 18 days ago • 15

liked 6 models 4 days ago

reacted to AdinaY's post with 👀 5 days ago

Post

1623

Qilin 🔥a large scale multimodal dataset for search, recommendation and RAG research, released by Xiaohongshu & Tsinghua University

Dataset: THUIR/Qilin
Paper: Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions (2503.00501)

✨Multiple content modalities (text, images, video thumbnails)
✨Rich user interaction data ( from Xiaohongshu’s 300M+ MAUs, 70%+ search penetration)
✨Comprehensive evaluation metrics
✨Support for RAG system development

reacted to albertvillanova's post with 🔥 5 days ago

Post

3695

🚀 Big news for AI agents! With the latest release of smolagents, you can now securely execute Python code in sandboxed Docker or E2B environments. 🦾🔒

Here's why this is a game-changer for agent-based systems: 🧵👇

1️⃣ Security First 🔐
Running AI agents in unrestricted Python environments is risky! With sandboxing, your agents are isolated, preventing unintended file access, network abuse, or system modifications.

2️⃣ Deterministic & Reproducible Runs 📦
By running agents in containerized environments, you ensure that every execution happens in a controlled and predictable setting—no more environment mismatches or dependency issues!

3️⃣ Resource Control & Limits 🚦
Docker and E2B allow you to enforce CPU, memory, and execution time limits, so rogue or inefficient agents don’t spiral out of control.

4️⃣ Safer Code Execution in Production 🏭
Deploy AI agents confidently, knowing that any generated code runs in an ephemeral, isolated environment, protecting your host machine and infrastructure.

5️⃣ Easy to Integrate 🛠️
With smolagents, you can simply configure your agent to use Docker or E2B as its execution backend—no need for complex security setups!

6️⃣ Perfect for Autonomous AI Agents 🤖
If your AI agents generate and execute code dynamically, this is a must-have to avoid security pitfalls while enabling advanced automation.

⚡ Get started now: https://github.com/huggingface/smolagents

What will you build with smolagents? Let us know! 🚀💡

reacted to Yehor's post with 👍 5 days ago

Post

2804

Published a stable version of Ukrainian Text-to-Speech library on GitHub and PyPI.

Features:

- Multi-speaker model: 2 female (Tetiana, Lada) + 1 male (Mykyta) voices;
- Fine-grained control over speech parameters, including duration, fundamental frequency (F0), and energy;
- High-fidelity speech generation using the RAD-TTS++ acoustic model;
- Fast vocoding using Vocos;
- Synthesizes long sentences effectively;
- Supports a sampling rate of 44.1 kHz;
- Tested on Linux environments and Windows/WSL;
- Python API (requires Python 3.9 or later);
- CUDA-enabled for GPU acceleration.

Repository: https://github.com/egorsmkv/tts_uk

reacted to DualityAI-RebekahBogdanoff's post with 🧠 5 days ago

Post

3172

Duality.ai just released a 1000 image dataset used to train a YOLOv8 model in multiclass object detection -- and it's 100% free!
duality-robotics/YOLOv8-Multiclass-Object-Detection-Dataset

Access the full size dataset by creating an EDU account here- https://falcon.duality.ai/secure/documentation/ex3-dataset?sidebarMode=learn

Or check it out in the linked HuggingFace dataset!

What makes this dataset unique, useful, and capable of bridging the Sim2Real gap?

💠 The digital twins are not generated by AI, but instead crafted by 3D artists to be INDISTINGUISHABLE from the physical-world objects. This allows the training from this data to transfer into real-world applicability

💠 The simulation software, called FalconEditor, can easily create thousands of images with varying lighting, posing, occlusions, backgrounds, camera positions, and more. This enables robust model training.

💠 The labels are created along with the data. This not only saves large amounts of time, but also ensures the labels are incredibly accurate and reliable.

This HuggingFace dataset is a 20 image and label sample, but you can get the rest at no cost by creating a FalconCloud account here: https://falcon.duality.ai/secure/documentation/ex3-dataset?sidebarMode=learn.

Once you verify your email, the link will redirect you to the dataset page.

reacted to prithivMLmods's post with 👍 5 days ago

Post

4839

SigLIP2 Image Classification 🧤

> https://huggingface.co./blog/prithivMLmods/siglip2-finetune-image-classification

reacted to Undi95's post with 👍 5 days ago

Post

4350

Hi there!

If you want to create your own thinking model or do a better MistralThinker, I just uploaded my entire dataset made on Deepseek R1 and the axolotl config. (well I made them public)

Axolotl config : Undi95/MistralThinker-v1.1

The dataset : Undi95/R1-RP-ShareGPT3

You can also read all I did on those two discord screenshot from two days ago, I'm a little lazy to rewrite all kek.

Hope you will use them!

5 replies

·

reacted to clem's post with 🔥 5 days ago

Post

5839

Super happy to welcome Nvidia as our latest enterprise hub customer. They have almost 2,000 team members using Hugging Face, and close to 20,000 followers of their org. Can't wait to see what they'll open-source for all of us in the coming months!

Nvidia's org: https://huggingface.co./nvidia
Enterprise hub: https://huggingface.co./enterprise

Victor Mustar PRO

AI & ML interests

Recent Activity

Organizations

victor's activity

CohereForAI/aya-vision-8b

motimalu/wan-flat-color-v2

Stable Diffusion 3.5 Large TurboX

Tight Inversion Pulid Demo

Skyreels A1 Talking Head

Spark TTS

YuLan-Mini

Qwen/Qwen2.5-Coder-14B-Instruct

mlx-community/Qwen2.5.1-Coder-7B-Instruct-4bit

mlx-community/QwQ-32B-4bit

agentica-org/DeepScaleR-1.5B-Preview

pipecat-ai/smart-turn

OpenPipe/Deductive-Reasoning-Qwen-32B