Image Generation & AI Art

129 repos

Sort by

Comfy-Org/ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

★ 110,102Pythonupdated 2026-04-20aicomfycomfyuipythonpytorch

microsoft/generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

★ 109,836Jupyter Notebookupdated 2026-04-16aiazurechatgptdall-egenerative-ai

mudler/LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

★ 45,835Goupdated 2026-04-20agentsaiapiaudio-generationdecentralized

danny-avila/LibreChat

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.

★ 36,001TypeScriptupdated 2026-04-20aianthropicartifactsawsazure

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

★ 34,251Pythonupdated 2026-03-26agentaiassistantchatchatgpt

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

★ 33,452Pythonupdated 2026-04-18deep-learningdiffusionfluximage-generationimage2image

invoke-ai/InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

★ 27,049TypeScriptupdated 2026-04-20ai-artartificial-intelligencegenerative-artimage-generationimg2img

PicoTrex/Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's development of image generation and unified models(click to website to see our blog)

★ 22,396updated 2025-12-12awesomegemini-2-5-flash-imagenano-banana

Comfy-Org/ComfyUI-Manager

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.

★ 14,387Pythonupdated 2026-04-19

vercel/satori

Enlightened library to convert HTML and CSS to SVG

★ 13,332TypeScriptupdated 2026-03-20cssimageimage-generationimage-generatorjsx

YouMind-OpenLab/awesome-nano-banana-pro-prompts

🍌 World's largest Nano Banana Pro prompt library — 10,000+ curated prompts with preview images, 16 languages. Google Gemini AI image generation. Free & open source.

★ 11,581TypeScriptupdated 2026-04-20ai-image-generationai-promptsawesomeawesome-listgemini

openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

★ 10,142C++updated 2026-04-20aicomputer-visiondeep-learningdeploy-aidiffusion-models

Anil-matcha/Open-Generative-AI

Uncensored, open-source alternative to Higgsfield AI, Freepik AI, Krea AI, Openart AI — Free, unrestricted AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.

★ 8,455JavaScriptupdated 2026-04-23ai-art-generatorai-image-generationai-video-generationcreative-toolsflux-1

carson-katri/dream-textures

Stable Diffusion built-in to Blender

★ 8,150Pythonupdated 2024-08-26aiblenderblender-addonimage-generationstable-diffusion

LykosAI/StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

★ 8,045C#updated 2026-04-19aiautomatic1111avaloniacomfyuidotnet

NexaAI/nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.

★ 7,973Kotlinupdated 2026-04-14gemma3gogpt-ossgranite4llama

QwenLM/Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

★ 7,815Pythonupdated 2026-02-10

civitai/civitai

A repository of models, textual inversions, and more

★ 7,089TypeScriptupdated 2026-04-18aisocial-networkstable-diffusion

vladmandic/sdnext

SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing

★ 7,075Pythonupdated 2026-04-20ai-artcaptiondiffusersgenerative-artpython

enricoros/big-AGI

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

★ 6,935TypeScriptupdated 2026-04-20agiai-agentsai-suiteai-workspaceanthropic-api

kijai/ComfyUI-WanVideoWrapper

★ 6,340Pythonupdated 2026-02-22

11cafe/jaaz

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

★ 6,170TypeScriptupdated 2026-03-02agentaiaiagentaiimageaiimagegenerator

Akegarasu/lora-scripts

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

★ 6,015Pythonupdated 2025-09-08dreamboothfinetunelorastable-diffusion

cubiq/ComfyUI_IPAdapter_plus

★ 5,921Pythonupdated 2025-04-14

promptslab/Awesome-Prompt-Engineering

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

★ 5,820TypeScriptupdated 2026-04-20chatgptchatgpt-apideep-learningfew-shot-learninggpt

thinkingjimmy/Learning-Prompt

Free prompt engineering online course. ChatGPT and Midjourney tutorials are now included!

★ 5,318CSSupdated 2023-09-17promptprompt-engineeringprompt-toolkit

philz1337x/clarity-upscaler

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

★ 5,038Pythonupdated 2025-03-06aiai-artimage-upscaleimage-upscalerimage-upscaling

transformerlab/transformerlab-app

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

★ 4,934Pythonupdated 2026-04-19diffusiondiffusion-modelselectronllamallms

comfyanonymous/ComfyUI_examples

Examples of ComfyUI workflows

★ 4,141HTMLupdated 2025-11-26

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

★ 4,009C#updated 2026-04-18aicomfyuicsharpimage-generationjavascript

Fannovel16/comfyui_controlnet_aux

ComfyUI's ControlNet Auxiliary Preprocessors

★ 3,946Pythonupdated 2026-04-13

MrForExample/ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

★ 3,720Pythonupdated 2025-12-29comfycomfyuimachine-learning

city96/ComfyUI-GGUF

GGUF Quantization support for native ComfyUI models

★ 3,545Pythonupdated 2026-01-12

Lightricks/ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

★ 3,517Pythonupdated 2026-04-13comfyuidiffusion-modelsditimage-to-videoimage-to-video-generation

n3d1117/chatgpt-telegram-bot

🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

★ 3,451Pythonupdated 2025-06-03chatgptdall-eopenaipythontelegram-bot

Kosinkadink/ComfyUI-AnimateDiff-Evolved

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

★ 3,440Pythonupdated 2026-03-30

calesthio/OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

★ 3,247Pythonupdated 2026-04-24agentagentic-aiaiclaudecopilot

cocktailpeanut/fluxgym

Dead simple FLUX LoRA training UI with LOW VRAM support

★ 3,206Pythonupdated 2026-03-31

SamurAIGPT/Generative-Media-Skills

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

★ 3,099Shellupdated 2026-04-13agent-toolsai-agentsai-artai-musicai-video

ltdrdata/ComfyUI-Impact-Pack

Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.

★ 3,081Pythonupdated 2026-04-19

yolain/ComfyUI-Easy-Use

In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.

★ 2,485Pythonupdated 2026-04-09

Yutong-Zhou-cv/Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

★ 2,436updated 2026-02-07awseome-listgenerative-adversarial-networkimage-generationimage-manipulationimage-synthesis

numz/ComfyUI-SeedVR2_VideoUpscaler

Official SeedVR2 Video Upscaler for ComfyUI

★ 2,380Pythonupdated 2025-12-24aicomfyuicomfyui-nodesupscalervideo-processing

pydn/ComfyUI-to-Python-Extension

A powerful tool that translates ComfyUI workflows into executable Python code.

★ 2,310Pythonupdated 2026-04-19ai-artcomfyuigenerative-artimage-generationpytorch

deforum/deforum-stable-diffusion

★ 2,288Pythonupdated 2024-08-30

YiVal/YiVal

Your Automatic Prompt Engineering Assistant for GenAI Applications

★ 2,127Pythonupdated 2024-04-22aiai-experimentsai-toolkitaigcapi

jd-opensource/JoyAI-Image

JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.

★ 1,963Pythonupdated 2026-04-15

1038lab/ComfyUI-RMBG

A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefNet, SDMatte, SAM, SAM2, SAM3 and GroundingDINO.

★ 1,899Pythonupdated 2026-02-03comfyuicomfyui-nodesimagemaskremove-background

Anil-matcha/Awesome-GPT-Image-2-API-Prompts

Curated GPT-Image-2 prompts for the OpenAI API — portraits, posters, UI mockups, game screenshots, character sheets, and more. Ready-to-use prompts for gpt-image-2.

★ 1,692updated 2026-04-23ai-artai-generated-artai-imageapiawesome-list

Kosinkadink/ComfyUI-VideoHelperSuite

Nodes related to video workflows

★ 1,603Pythonupdated 2026-04-14

BennyKok/comfyui-deploy

An open source `vercel` like deployment platform for Comfy UI

★ 1,504TypeScriptupdated 2025-11-13

Enemyx-net/VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

★ 1,466Pythonupdated 2026-02-18ai-audioai-ttsai-voiceai-voice-cloneai-voice-clonining

MiniMax-AI/MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

★ 1,443Pythonupdated 2026-04-15image-generationimage-to-videomcpmcp-servermcp-tools

11cafe/comfyui-workspace-manager

A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse subworkflows, install models, browse your models in a single workspace

★ 1,430TypeScriptupdated 2025-04-16comfyui

tin2tin/Pallaidium

PALLAIDIUM — a generative AI movie studio, seamlessly integrated into the Blender Video Editor (VSE), enabling end-to-end production from script to screen and back.

★ 1,368Pythonupdated 2026-04-03aiaicinemablenderchatterboxdiffusion

Capsize-Games/airunner

Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

★ 1,314Pythonupdated 2026-02-10aiai-artartasset-generatorchatbot

zanllp/infinite-image-browsing

A full-featured image/video management app with AI-powered organization and semantic search. Supports metadata from SD-webui, ComfyUI, Fooocus, NovelAI, StableSwarmUI, and more. Available as standalone app, SD-webui extension, or library.

★ 1,291Vueupdated 2026-04-08audiocomfyuiextensionfile-explorerfile-server

0xacx/chatGPT-shell-cli

Simple shell script to use OpenAI's ChatGPT and DALL-E from the terminal. No Python or JS required. Formerly https://gptshell.cc

★ 1,238Shellupdated 2024-06-19bashchatbotchatgptchatgpt-apichatgpt-api-wrapper

chengzeyi/Comfy-WaveSpeed

https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

★ 1,227Pythonupdated 2025-08-02

modal-labs/modal-examples

Examples of programs built using Modal

★ 1,172Pythonupdated 2026-04-20clouddistributedgpumachine-learningmodal

Tavris1/ComfyUI-Easy-Install

Portable ComfyUI installer for Windows, macOS and Linux 🔹 Nvidia GPU support 🔹 Pixaroma Community Edition

★ 1,147Batchfileupdated 2026-04-19comfyuicomfyui-easy-installcomfyui-installercomfyui-portable-installerpixaroma

kijai/ComfyUI-FluxTrainer

★ 1,134Pythonupdated 2025-04-02

willmiao/ComfyUI-Lora-Manager

LoRA Manager for ComfyUI - A powerful extension for organizing, previewing, and integrating LoRA models with metadata and workflow support.

★ 1,101Pythonupdated 2026-04-20

cubiq/ComfyUI_essentials

★ 1,101Pythonupdated 2025-04-14

ai-dock/comfyui

ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.

★ 1,030Shellupdated 2024-11-04aicomfyuidockerimage-processingrunpod

RupertAvery/DiffusionToolkit

Metadata-indexer and Viewer for AI-generated images

★ 987C#updated 2026-02-27csharpdotnetimage-indexingimage-viewerpnginfo

Kosinkadink/ComfyUI-Advanced-ControlNet

ControlNet scheduling and masking nodes with sliding context support

★ 965Pythonupdated 2026-03-30

diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

★ 891Pythonupdated 2026-04-17ai-audioaudioaudio-editingaudio-generationaudio-processing

patientx/ComfyUI-Zluda

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.

★ 869Pythonupdated 2026-04-19amdcomfyuicudarocmstable-diffusion

stavsap/comfyui-ollama

★ 817Pythonupdated 2025-10-23

ltdrdata/ComfyUI-Inspire-Pack

This repository offers various extension nodes for ComfyUI. Nodes here have different characteristics compared to those in the ComfyUI Impact Pack. The Impact Pack has become too large now...

★ 779Pythonupdated 2025-11-17

Comfy-Org/comfy-cli

Command Line Interface for Managing ComfyUI

★ 760Pythonupdated 2026-04-20aicomfyuicommand-linestable-diffusion

FlyMyAI/flymyai-lora-trainer

Qwen-Image text to image lora trainer

★ 739Pythonupdated 2025-12-16

Haervwe/open-webui-tools

Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more

★ 681Pythonupdated 2026-04-20academic-researchai-agentsai-workstationarxivcomfyui

markfulton/NanoBananaEditor

The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version history, and more. Powered by Gemini 2.5 Flash images API.

★ 675TypeScriptupdated 2025-09-17aiimagesboltimageeditingimageeditorimagegeneration

talesofai/comfyui-browser

An image/video/workflow browser and manager for ComfyUI.

★ 659Svelteupdated 2024-11-11comfyuicomfyui-browsercomfyui-managerstable-diffusionworkflows

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

★ 548HTMLupdated 2025-04-04aigclarge-language-modelslarge-vision-language-modelsllmlvlm

LarryJane491/Lora-Training-in-Comfy

This custom node lets you train LoRA directly in ComfyUI!

★ 540Pythonupdated 2024-08-05

kijai/ComfyUI-WanAnimatePreprocess

ComfyUI nodes for WanAnimate model input preprocessing

★ 505Pythonupdated 2025-12-22

mehmetkahya0/AI-Catalog

Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.

★ 479Shellupdated 2026-03-18agentsaiartificial-intelligenceartificial-intelligence-algorithmsawesome

liasece/sd-webui-train-tools

The stable diffusion webui training aid extension helps you quickly and visually train models such as Lora.

★ 423Pythonupdated 2024-03-28

raindrop313/ComfyUI-WanVideoStartEndFrames

Start and end frames video generation nodes based on the modified Kijai version Wan2.1 nodes

★ 383Pythonupdated 2025-03-22

Siddhesh2377/ToolNeuron

On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.

★ 379Kotlinupdated 2026-03-15ai-personasandroidgguf-modelsjetpack-composekotlin

VrchStudio/comfyui-web-viewer

ComfyUI custom nodes and web utilities for real-time AI generation and interaction

★ 347Pythonupdated 2026-03-26aigcartcomfyui-nodes

ltdrdata/ComfyUI-Impact-Subpack

This extension serves as a complement to the Impact Pack, offering features that are not deemed suitable for inclusion by default in the ComfyUI Impact Pack

★ 338Pythonupdated 2025-07-22

joenorton/comfyui-mcp-server

lightweight Python-based MCP (Model Context Protocol) server for local ComfyUI

★ 291Pythonupdated 2026-02-17comfyuicomfyui-apiimage-mcplocal-datalocal-mcp

themanyone/whisper_dictation

Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.

★ 289Pythonupdated 2025-12-30aiassistant-chat-botsassistive-technologyclientclient-server

vargHQ/sdk

AI video generation SDK — JSX for videos. One API for Kling, Flux, ElevenLabs, Sora. Built on Vercel AI SDK.

★ 284TypeScriptupdated 2026-04-27ai-sdkai-videoclaude-codecursorelevenlabs

mbodiai/embodied-agents

Seamlessly integrate state-of-the-art transformer models into robotics stacks

★ 284Pythonupdated 2025-12-16agentsartificial-intelligencediffusionembodiedembodied-agent

flowtyone/ComfyUI-Flowty-LDSR

LDSR custom node for ComfyUI

★ 269Pythonupdated 2024-06-14

alphatrait/100000-ai-prompts-by-contentifyai

Welcome to the ChatGPT Prompts Library! This repository contains a diverse collection of over 100,000 prompts tailored for ChatGPT. Our prompts cover a wide range of topics, including marketing, business, fun, and much more.

★ 240updated 2023-11-23aibardbard-promptschatgptchatgpt-prompt

yuvraj108c/ComfyUI-Whisper

Transcribe audio and add subtitles to videos using Whisper in ComfyUI

★ 229Pythonupdated 2026-01-02comfyuistable-diffusionwhisper-ai

BB31420/AI-Auto-Video-Generator

An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.

★ 216Pythonupdated 2024-09-17aiartificial-intelligencedall-eediting-videosopenai

replicate/comfyui-replicate

Run Replicate models as nodes in ComfyUI

★ 207Pythonupdated 2024-11-05aicomfy-uicomfyuicomfyui-nodesreplicate

yuvraj108c/4k-video-upscaler-colab

Upscale your videos up to 4k on free google colab using Real-ESRGAN

★ 207Jupyter Notebookupdated 2025-05-014krealesrganstable-diffusionsuper-resolutionupscaler

ltdrdata/was-node-suite-comfyui

An extensive node suite for ComfyUI with over 210 new nodes

★ 198Jupyter Notebookupdated 2025-09-26

twwch/comfyui-workflow-skill

Natural language → ComfyUI workflow JSON. 34 built-in templates, 360+ node definitions, auto model download. Supports txt2img, img2img, txt2vid, img2vid, audio, 3D generation across SD1.5/SDXL/SD3/FLUX/Wan2.2/HunyuanVideo/LTXV/Mochi/Cosmos + LLM integration. Works as a skill for Claude Code, Cursor, and other AI coding agents.

★ 190updated 2026-04-09

gokayfem/ComfyUI-fal-API

Custom nodes for using fal API.

★ 182Pythonupdated 2026-03-19comfyuifluxkling

al-swaiti/ComfyUI-OllamaGemini

AI-api text generation

★ 175Pythonupdated 2026-03-01

yjg30737/pyqt-openai

VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting various models including GPT).

★ 151Pythonupdated 2026-04-20aichatbotchatgpt-desktopclaudegemini

WaveSpeedAI/wavespeed-desktop

A cross-platform desktop application for running AI models from [WaveSpeedAI](https://wavespeed.ai), as well as many free local AI models including Z-Image.

★ 147TypeScriptupdated 2026-04-24aiai-image-generationai-image-generatorimagevideo

vivoCameraResearch/Hyper-Motion

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

★ 140Pythonupdated 2026-03-10dithuman-video-animationhuman-video-generationmotion-generationpose-guided-text-to-image-generation

ShmuelRonen/ComfyUI-VideoUpscale_WithModel

A memory-efficient implementation for upscaling videos in ComfyUI using non-diffusion upscaling models. This custom node is designed to handle large video frame sequences without memory bottlenecks.

★ 121Pythonupdated 2025-09-18

JPhilipp/powerdalle

A Dall-E 3 localhost web UI for using advanced settings like style (vivid vs natural) or quality (standard vs hd). Can also be used when ChatGPT's Dall-E throttles you for the day, if you are ready to pay the API call costs. Comes integrated with Prompt Inspirer!

★ 105JavaScriptupdated 2025-05-10

noahgsolomon/deepfish

A visual node-based editor for building, sharing, and executing complex AI workflows with Fal.ai and Replicate.

★ 96TypeScriptupdated 2025-11-23aicomfyui-webfal-aigenerative-ailow-code

daxcay/ComfyUI-Nexus

Node to enable seamless multiuser workflow collaboration

★ 96JavaScriptupdated 2025-03-01collaborationcomfyuicomfyui-nodescomfyui-workflowmultiplayer

wildminder/ComfyUI-Chatterbox

ComfyUI Chatterbox TTS & Voice Conversion Node

★ 93Pythonupdated 2025-08-21

facebookresearch/EvalGIM

🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic evaluations of text-to-image models and supports customization with user-defined metrics, datasets, and visualizations.

★ 92Pythonupdated 2026-02-05

ShmuelRonen/FluxKontextCreator

A powerful ComfyUI node for text-based image editing using Black Forest Labs' Flux Kontext API.

★ 82Pythonupdated 2025-06-10

ShmuelRonen/ComfyUI-NanoBanano

A ComfyUI custom node for Google's Gemini 2.5 Flash Image (aka "Nano Banana") model - the state-of-the-art image generation and editing AI that went viral for its incredible quality and capabilities.

★ 77Pythonupdated 2025-08-27

FMXExpress/Stable-Diffusion-Desktop-Client

Stable Diffusion Desktop client for Windows, macOS, and Linux built in Embarcadero Delphi.

★ 71Pascalupdated 2023-08-15aiartificial-intelligencecodeformerdeepfloyddelphi

Ktiseos-Nyx/Dataset-Tools

An Extensive AI & Camera Metadata Viewer

★ 68TypeScriptupdated 2026-04-14automatic1111comfyui-workflowdatasetexif-metadataflux

AIFSH/ComfyUI-WhisperX

a comfyui cuatom node for audio subtitling based on whisperX and translators

★ 62Pythonupdated 2025-04-01srt-subtitlessutitlestranslationwhisper

replicate/reflux

Replicate Flux LoRA image editor.

★ 54Vueupdated 2024-09-03

Mateusz-Dera/ROCm-AI-Installer

Installation script for an AI applications using ROCm on Linux.

★ 45Shellupdated 2026-04-183daiamdamdgpuaudio

raveenb/fal-mcp-server

MCP server for Fal.ai - Generate images, videos, music and audio with Claude

★ 43Pythonupdated 2026-03-30ai-toolsclaudefal-aiimage-generationllm

corundex/ComfyUI-ROCm

ComfyUI with AMD ROCm support for GPU-accelerated AI image generation on AMD RX 6000/7000+ GPUs

★ 42Pythonupdated 2025-07-20

velvet-shark/banana-straightener

Self-correcting image generation for Gemini's Nano Banana model

★ 37Pythonupdated 2025-10-09

guinacio/claude-image-gen

AI-powered image generation using Google Gemini, integrated with Claude Code via Skills or Claude.ai via MCP (Model Context Protocol).

★ 34JavaScriptupdated 2026-03-15claudeclaude-codeclaude-extensionsclaude-pluginsclaude-skills

sinanuozdemir/oreilly-multimodal-ai

Learn how multimodal AI merges text, image, and audio for smarter models

★ 30Jupyter Notebookupdated 2025-01-21dalle-3deepgramdiffusiondreamboothgenerative-ai

iGavroche/rocm-ninodes

RocM Optimized ComfyUI nodes

★ 30Pythonupdated 2026-03-08

tubsn/gpt-buddy

Prompt Management System for Interaction with the ChatGPT API

★ 23JavaScriptupdated 2026-04-07aiaudio-transcribingimage-generationprompt-databaseprompts

FMXExpress/Text2Video-Desktop-Client

Generate videos from text using various Stable Diffusion Models via Text2Video-Zero.

★ 23Pascalupdated 2023-05-16aidelphilinuxmacosobject-pascal

deepgram/voice-keyboard-linux

Linux virtual keyboard driver which types what you say using Deepgram Flux STT API

★ 13Rustupdated 2026-02-03

iDAPPA/ComfyUI-AMDGPUMonitor

AMD GPU Monitor for ComfyUI

★ 11JavaScriptupdated 2025-03-13

vulca-org/vulca

Agent-native image-editing SDK for Claude Code. 21 MCP tools + /decompose skill — semantic layer splits, L1–L5 cultural scoring, region inpaint. Powered by ComfyUI, Gemini, or mock.

★ 8Pythonupdated 2026-04-26agent-nativeaianthropicartclaude-code

hyun-yang/MyChatGPT

The ultimate PyQt6 application that integrates the power of OpenAI, Google Gemini, Claude, and other open-source AI models

★ 7Pythonupdated 2025-12-07agentchatgptclaudedalleevaluator-optimizer-workflow

PeterBlenessy/TeamAI

The TeamAI application allows users to create a team of AI powered assistants with individual capabilities, personas. The AI assistants will solve the task requested by the user as a team effort, each bot contributing with its respective capabilities. Supported providers are Ollama and OpenAI.

★ 7JavaScriptupdated 2025-09-09chat-applicationchatgptdall-egptgpt-4

broomva/ltx-video

LTX-2.3 video generation skill — setup, inference, prompting, ComfyUI integration for Lightricks 22B DiT audio-video model

★ 4Pythonupdated 2026-03-27