Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
20
Awario
Awario
Follow
0 followers
Β·
25 following
AI & ML interests
None yet
Recent Activity
reacted
to
alibabasglab
's
post
with π
17 days ago
We are thrilled to present the improved "ClearerVoice-Studio", an open-source platform designed to make speech processing easy use for everyone! Whether youβre working on speech enhancement, speech separation, speech super-resolution, or target speaker extraction, this unified platform has you covered. ** Why Choose ClearerVoice-Studio?** - Pre-Trained Models: Includes cutting-edge pre-trained models, fine-tuned on extensive, high-quality datasets. No need to start from scratch! - Ease of Use: Designed for seamless integration with your projects, offering a simple yet flexible interface for inference and training. **Where to Find Us?** - GitHub Repository: ClearerVoice-Studio (https://github.com/modelscope/ClearerVoice-Studio) - Try Our Demo: Hugging Face Space (https://huggingface.co./spaces/alibabasglab/ClearVoice) **What Can You Do with ClearerVoice-Studio?** - Enhance noisy speech recordings to achieve crystal-clear quality. - Separate speech from complex audio mixtures with ease. - Transform low-resolution audio into high-resolution audio. A full upscaled LJSpeech-1.1-48kHz dataset can be downloaded from https://huggingface.co./datasets/alibabasglab/LJSpeech-1.1-48kHz . - Extract target speaker voices with precision using audio-visual models. **Join Us in Growing ClearerVoice-Studio!** We believe in the power of open-source collaboration. By starring our GitHub repository and sharing ClearerVoice-Studio with your network, you can help us grow this community-driven platform. **Support us by:** - Starring it on GitHub. - Exploring and contributing to our codebase . - Sharing your feedback and use cases to make the platform even better. - Joining our community discussions to exchange ideas and innovations. - Together, letβs push the boundaries of speech processing! Thank you for your support! :sparkling_heart:
reacted
to
sanaka87
's
post
with π₯
17 days ago
π Excited to Share Our Latest Work: 3DIS & 3DIS-FLUX for Multi-Instance Layout-to-Image Generation! β€οΈβ€οΈβ€οΈ π¨ Daily Paper: https://huggingface.co./papers/2501.05131#community π Code is now open source! π Project Website: https://limuloo.github.io/3DIS/ π GitHub Repository: https://github.com/limuloo/3DIS π 3DIS Paper: https://arxiv.org/abs/2410.12669 π 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131 π₯ Why 3DIS & 3DIS-FLUX? Current SOTA multi-instance generation methods are typically adapter-based, requiring additional control modules trained on pre-trained models for layout and instance attribute control. However, with the emergence of more powerful models like FLUX and SD3.5, these methods demand constant retraining and extensive resources. β¨ Our Solution: 3DIS We introduce a decoupled approach that only requires training a low-resolution Layout-to-Depth model to convert layouts into coarse-grained scene depth maps. Leveraging community and company pre-trained models like ControlNet + SAM2, we enable training-free controllable image generation on high-resolution models such as SDXL and FLUX. π Benefits of Our Decoupled Multi-Instance Generation: 1. Enhanced Control: By constructing scenes using depth maps in the first stage, the model focuses on coarse-grained scene layout, improving control over instance placement. 2. Flexibility & Preservation: The second stage employs training-free rendering methods, allowing seamless integration with various models (e.g., fine-tuned weights, LoRA) while maintaining the generative capabilities of pre-trained models. Join us in advancing Layout-to-Image Generation! Follow and star our repository to stay updated! β
upvoted
a
paper
23 days ago
TransPixar: Advancing Text-to-Video Generation with Transparency
View all activity
Organizations
None yet
Awario
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
4 models
23 days ago
wileewang/TransPixar
Text-to-Video
β’
Updated
20 days ago
β’
1.9k
β’
32
JeffreyXiang/TRELLIS-image-large
Image-to-3D
β’
Updated
Dec 6, 2024
β’
697k
β’
342
CiaraRowles/IP-Adapter-Instruct
Image-to-Image
β’
Updated
Aug 13, 2024
β’
93
β’
49
BytedanceDouyinContent/SAIL-VL-2B
Updated
18 days ago
β’
244
β’
25
liked
a model
about 2 months ago
TencentARC/BrushEdit
Image-to-Image
β’
Updated
Dec 16, 2024
β’
25
liked
a Space
about 2 months ago
Running
on
Zero
771
π
Flux Style Shaping
Optical illusions and style transfer with FLUX
liked
3 models
2 months ago
xyfJASON/ctrlora
Updated
Oct 15, 2024
β’
7
black-forest-labs/FLUX.1-Redux-dev
Updated
Nov 25, 2024
β’
73.5k
β’
379
black-forest-labs/FLUX.1-Depth-dev-lora
Updated
Dec 11, 2024
β’
8.41k
β’
142
liked
2 Spaces
4 months ago
Running
626
π
Qwen2.5
Running
606
π
Qwen2-VL-72B
liked
a Space
7 months ago
Running
on
Zero
302
β‘
Video Transcription Smart Summary
liked
a Space
8 months ago
Sleeping
1
π¦
OpenAI-Whisper Audio2Text WebUI
liked
3 Spaces
9 months ago
Runtime error
8
π
Depth Anything
Running
on
Zero
10
π
ZeST
Zero-Shot Material Transfer from a Single Image
Sleeping
177
π
ZeST
Zero-Shot Material Transfer from a Single Image
liked
a Space
10 months ago
Running
339
π’
TransferAnything
liked
3 Spaces
about 1 year ago
Runtime error
291
π
T2I-Adapter-SDXL
Runtime error
39
π
Cross Image Attention
Running
on
A10G
510
π¬ππ
Unofficial SDXL Turbo Img2Img Txt2Img