Detect and estimate poses in images
Segment objects in videos with point clicks
Request evaluation for speech models
Talk to Qwen2Audio with Gradio and WebRTC ā”ļø