pheme@lucataco

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

strotss@eleazhong

image to image style transfer using STROTSS loss

sdxl-vermeer@georgedavila

SDXL LoRA finetuned on Vermeer paintings

custum_model_safetonsors@zelenioncode

DreamBooth safetensors model use RealVisXL

stable-diffusion@stability-ai

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

clipdraw-interactive@evilstreak

Morphs vector paths towards a text prompt

haiku-progressive-image@zeke

A model for testing pydantic cog that yields images one word at a time.

damo-text-to-video@cjwbw

Multi-stage text-to-video generation

colorize-line-art@camenduru

ControlNet Line Art Anime

bunny-phi-2-siglip@adirik

Lightweight multimodal model for visual question answering, reasoning and captioning

show-1@cjwbw

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

v-sekai.mediapipe-labeler@fire

Mediapipe Blendshape Labeler - Predicts the blend shapes of an image.

prismer@nvlabs

A Vision-Language Model with An Ensemble of Experts

dab_stain_analyser@theleelab

Detects protein in DAB image

real-esrgan-a100@daanelson

Real-ESRGAN for image upscaling on an A100

latent-sr@nightmareai

Upscale images with the latent diffusion superresolution model

dreamshaper-v7@pagebrain

T4 GPU, negative embeddings, img2img, inpainting, safety checker, KarrasDPM, pruned fp16 safetensor

diffedit-stable-diffusion@cjwbw

Diffusion-based semantic image editing with mask guidance

twinpainting@andreasjansson

Turn two prompts into one image

gdmjp2@gymdreams8

Paintings in the style of selected artists with weights, from the Construction Series of GymDreams8.