gitok - Develop and Earn

Demucs Music Source Separation

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team, with ControlNet

sahil2801/replit-code-instruct-glaive

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage photorealism

Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.

SDXL trained on a variety of spectrograms

Create cool looking gold metal things!

360 Panorama SDXL image with inpainted wrapping seam

High-Fidelity Text-to-3D Generation via Interval Score Matching

Qwen-VL-Chat but with raw ChatML prompt interface and streaming

BLIP3(XGen-MM) is a series of foundational Large Multimodal Models (LMMs) developed by Salesforce AI Research

DO NOT USE - Broken - Only Public For API Usage & Debugging

generate pixel art sprite sheets from four different angles with Stable-diffusion

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

controlnet 1.1 lineart x realistic-vision-v2.0 (updated to v5)

SDXL Inpainting developed by the HF Diffusers team

Conceptual image-to-image model for Stable Diffusion 1.5

PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"

An example model created from cli

powerful open-source visual language model