Demucs Music Source Separation
Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team, with ControlNet
sahil2801/replit-code-instruct-glaive
InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage photorealism
Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.
SDXL trained on a variety of spectrograms
Create cool looking gold metal things!
360 Panorama SDXL image with inpainted wrapping seam
High-Fidelity Text-to-3D Generation via Interval Score Matching
Qwen-VL-Chat but with raw ChatML prompt interface and streaming
BLIP3(XGen-MM) is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research
DO NOT USE - Broken - Only Public For API Usage & Debugging
generate pixel art sprite sheets from four different angles with Stable-diffusion
PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences
controlnet 1.1 lineart x realistic-vision-v2.0 (updated to v5)
SDXL Inpainting developed by the HF Diffusers team
Conceptual image-to-image model for Stable Diffusion 1.5
PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"
An example model created from cli
powerful open-source visual language model