SynCD
Image generator/identifier/reposer
Generate click coordinates from image and instruction
Text-to-3D and Image-to-3D Generation
Detect and annotate poses in images and videos
FitDiT is a high-fidelity virtual try-on model.
Find similar images from a dataset
Create 3D models from images
β¨[With v1.0.0] Accelerated TTS on Kokoro-82M
Transform research papers and mathematical concepts into stu
Extract clothing from images using a mask
Audio Conditioned LipSync with Latent Diffusion Models
Gaze detection using Moondream
Extend images to custom sizes and alignments