gitok - Develop and Earn

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

image to image style transfer using STROTSS loss

SDXL LoRA finetuned on Vermeer paintings

DreamBooth safetensors model use RealVisXL

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

Morphs vector paths towards a text prompt

A model for testing pydantic cog that yields images one word at a time.

Multi-stage text-to-video generation

ControlNet Line Art Anime

Lightweight multimodal model for visual question answering, reasoning and captioning

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Mediapipe Blendshape Labeler - Predicts the blend shapes of an image.

A Vision-Language Model with An Ensemble of Experts

Detects protein in DAB image

Real-ESRGAN for image upscaling on an A100

Upscale images with the latent diffusion superresolution model

T4 GPU, negative embeddings, img2img, inpainting, safety checker, KarrasDPM, pruned fp16 safetensor

Diffusion-based semantic image editing with mask guidance

Turn two prompts into one image

Paintings in the style of selected artists with weights, from the Construction Series of GymDreams8.