Bils (Bilel Aroua)

reacted to their post with 😎🚀 about 4 hours ago

Post

256

Spatial sound experience! SonicOrbit features AI beat detection to auto-sync your rhythm.

Bils/SonicOrbit

posted an update about 4 hours ago

Post

256

Spatial sound experience! SonicOrbit features AI beat detection to auto-sync your rhythm.

Bils/SonicOrbit

reacted to their post with 👍 about 6 hours ago

Post

2001

create amazing audio ads in just a few steps
Bils/AIPromoStudio

reacted to their post with 🚀 4 days ago

Post

2001

create amazing audio ads in just a few steps
Bils/AIPromoStudio

reacted to burtenshaw's post with 👍 8 days ago

Post

5281

I made a real time voice agent with FastRTC, smolagents, and hugging face inference providers. Check it out in this space:

🔗 burtenshaw/coworking_agent

9 replies

·

reacted to freddyaboulton's post with 🚀 9 days ago

Post

3105

Getting WebRTC and Websockets right in python is very tricky. If you've tried to wrap an LLM in a real-time audio layer then you know what I'm talking about.

That's where FastRTC comes in! It makes WebRTC and Websocket streams super easy with minimal code and overhead.

Check out our org: hf.co/fastrtc

posted an update 9 days ago

Post

2001

create amazing audio ads in just a few steps
Bils/AIPromoStudio

reacted to their post with 🔥 about 1 month ago

Post

2113

🚀 We're excited to share major improvements to our Janus-Pro-7B Text-to-Image Generation Space!
🎨What's New:
1-Critical Bug Fixes
2-Enhanced Features
3-UI Improvements
4-Performance Boost
Try It Now:
Bils/DeepseekJanusPro-Image

reacted to their post with ❤️ about 1 month ago

Post

1875

🚀 Explore the powerful Janus-Pro-7B Text-to-Image Generator! Transform your prompts into stunning visuals with state-of-the-art AI.
Bils/DeepseekJanusPro-Image

2 replies

·

reacted to fdaudens's post with 🔥 about 1 month ago

Post

3388

🎯 Kokoro TTS just hit v1.0! 🚀

Small but mighty: 82M parameters, runs locally, speaks multiple languages. The best part? It's Apache 2.0 licensed!
This could unlock so many possibilities ✨

Check it out: hexgrad/Kokoro-82M

1 reply

·

reacted to hexgrad's post with ❤️ about 1 month ago

Post

8396

hexgrad/Kokoro-82M got an upgrade! ⬆️ More voices, more languages, pip install kokoro, and still 82M parameters.

GitHub: https://github.com/hexgrad/kokoro
PyPI: https://pypi.org/project/kokoro/
Space: hexgrad/Kokoro-TTS

11 replies

·

reacted to their post with 🚀 about 1 month ago

Post

2113

🚀 We're excited to share major improvements to our Janus-Pro-7B Text-to-Image Generation Space!
🎨What's New:
1-Critical Bug Fixes
2-Enhanced Features
3-UI Improvements
4-Performance Boost
Try It Now:
Bils/DeepseekJanusPro-Image

posted an update about 1 month ago

Post

2113

🚀 We're excited to share major improvements to our Janus-Pro-7B Text-to-Image Generation Space!
🎨What's New:
1-Critical Bug Fixes
2-Enhanced Features
3-UI Improvements
4-Performance Boost
Try It Now:
Bils/DeepseekJanusPro-Image

replied to their post about 1 month ago

Copy and paste your prompt in our space and view the outcome. https://huggingface.co./spaces/Bils/DeepseekJanusPro-Image

reacted to their post with 🔥🚀 about 1 month ago

Post

1875

🚀 Explore the powerful Janus-Pro-7B Text-to-Image Generator! Transform your prompts into stunning visuals with state-of-the-art AI.
Bils/DeepseekJanusPro-Image

2 replies

·

posted an update about 1 month ago

Post

1875

🚀 Explore the powerful Janus-Pro-7B Text-to-Image Generator! Transform your prompts into stunning visuals with state-of-the-art AI.
Bils/DeepseekJanusPro-Image

2 replies

·

reacted to multimodalart's post with ❤️ about 1 year ago

Post

The Stable Diffusion 3 research paper broken down, including some overlooked details! 📝

Model
📏 2 base model variants mentioned: 2B and 8B sizes

📐 New architecture in all abstraction levels:
- 🔽 UNet; ⬆️ Multimodal Diffusion Transformer, bye cross attention 👋
- 🆕 Rectified flows for the diffusion process
- 🧩 Still a Latent Diffusion Model

📄 3 text-encoders: 2 CLIPs, one T5-XXL; plug-and-play: removing the larger one maintains competitiveness

🗃️ Dataset was deduplicated with SSCD which helped with memorization (no more details about the dataset tho)

Variants
🔁 A DPO fine-tuned model showed great improvement in prompt understanding and aesthetics
✏️ An Instruct Edit 2B model was trained, and learned how to do text-replacement

Results
✅ State of the art in automated evals for composition and prompt understanding
✅ Best win rate in human preference evaluation for prompt understanding, aesthetics and typography (missing some details on how many participants and the design of the experiment)

Paper: https://stabilityai-public-packages.s3.us-west-2.amazonaws.com/Stable+Diffusion+3+Paper.pdf

3 replies

·

reacted to MehdiLeZ's post with ❤️ about 1 year ago

Post

Dear music lovers 🕺,

MusicLang Space is now live: musiclang/README

MusicLang is a controllable model for music generation:

> 🦙 Discover the LLAMA2 architecture, trained from scratch for symbolic music generation, ensuring exceptional quality;
> 👨‍🎨 Unleash your creativity by extending an existing music, or create new ones from scratch;
> 🤖 Integrate MusicLang into your applications, with an inference optimized for CPUs written in C, other integrations and optimizations coming soon.

In the space, you’ll find :

1️⃣ MusicLang foundation model: our fondation model for creating and generating original midi soundtracks musiclang/musiclang-v2;

2️⃣ MusicLang predict: our AI prediction api of the MusicLang package https://github.com/musiclang/musiclang_predict?tab=readme-ov-file;

3️⃣ MusicLang Language:a new language for tonal music. This language allows composers to load, write, transform and predict symbolic music in a simple, condensed and high level manner https://github.com/MusicLang/musiclang;

4️⃣ MusicLang Demo Space: musiclang/musiclang-predict

5️⃣ Our Colab: https://colab.research.google.com/drive/1MA2mek826c05BjbWk2nRkVv2rW7kIU_S?usp=sharing

Help us share the future of music composition! Spread the word, show your support by adding a star or contribute to our project. ⭐️✨

Music Sounds Definitely Better with You 🎶 🖤

cc @floriangardin @MehdiLeZ @reach-vb

Thanks a lot,

The MusicLang team ❤️

8 replies

·

Bilel Aroua PRO

AI & ML interests

Recent Activity

Organizations

Bils's activity