C ComfyAtlas

Published May 19, 2026

Understanding ComfyUI Nodes: A Visual Guide for Beginners

What ComfyUI nodes actually are, how data flows between them, and the seven core nodes that make up the default text-to-image workflow. Written for people who just opened ComfyUI for the first time.

The first time you open ComfyUI you see a graph of boxes connected by colored wires. It looks intimidating. It is not.

A ComfyUI workflow is just a chain of small operations. Each box (a node) takes some inputs, does one thing, and passes its output to the next box. Once you know what the seven core nodes do, you can read any text-to-image workflow on the internet.

This guide walks through the default workflow node by node. It assumes you have ComfyUI installed and a checkpoint loaded. If not, see the installation guide.

How a node works

Every node has three parts:

A wire connects an output socket on one node to an input socket on another. Data flows left to right.

When you click Queue Prompt, ComfyUI starts at the rightmost nodes (Save Image), traces backward to find what they need, and runs every node in dependency order. You don’t tell it the order. The graph is the order.

The seven data types

Sockets are color-coded. The color tells you what kind of data flows through that wire. You can only connect outputs to inputs of the same color.

ColorTypeWhat it is
PurpleMODELThe diffusion model itself (the U-Net)
YellowCLIPThe text encoder
RedVAEThe variational autoencoder (latent ↔ pixel converter)
OrangeCONDITIONINGAn encoded prompt — the result of running text through CLIP
PinkLATENTAn image in latent space (compressed, what the model actually works on)
BlueIMAGEA regular RGB image you can see
GrayINT / FLOAT / STRINGPlain numbers and text

These exact colors may shift between ComfyUI versions, but the categories are stable.

The reason there are so many types is the diffusion process itself. A model doesn’t operate on pixels. It operates on a compressed representation called a latent. The VAE compresses pixels into latents and decompresses them back. The model takes a latent plus a CONDITIONING (encoded prompt) and produces a less-noisy latent, repeated many times.

The default workflow, node by node

When you first launch ComfyUI, the graph that loads is a complete text-to-image pipeline. Here is what each node does.

1. Load Checkpoint

This is the entry point. It reads a checkpoint file from models/checkpoints/ and outputs three things:

A checkpoint is a single file (usually .safetensors) that packages all three. SDXL checkpoints, SD 1.5 checkpoints, and FLUX checkpoints all use the same Load Checkpoint node. The model knows what it is internally.

The dropdown widget lists every checkpoint in your models/checkpoints/ folder. If you just added a new model, hit the refresh icon at the top right of ComfyUI to see it.

2. CLIP Text Encode (Prompt) — positive

This node takes:

It outputs CONDITIONING — the prompt encoded into the format the model needs.

Two of these nodes are wired up by default. One is the positive prompt (what you want), the other is the negative prompt (what you don’t want).

The text you type here is sent through CLIP’s text tokenizer and encoder. The model never sees your raw text. It sees a vector representation.

3. CLIP Text Encode (Prompt) — negative

Same node, second copy, used for the negative prompt. Common starting negative prompts include things like blurry, low quality, watermark. Some newer models (FLUX, SD3) ignore negative prompts entirely — leave it empty.

4. Empty Latent Image

This node creates a blank latent canvas. It has three widgets:

It outputs a LATENT — a tensor full of zeros at the requested dimensions. The KSampler will fill it with noise and then iteratively denoise it.

Native resolutions to know:

Going much larger than native produces tiling artifacts. Going much smaller produces deformed faces. Stick close to native for the first run.

5. KSampler

This is the workhorse. It takes:

And widgets:

The output is a LATENT — your generated image, but still in compressed latent form.

6. VAE Decode

The latent the KSampler produced is not viewable. It has 4 channels at 1/8 the resolution of the final image. This node converts it back to pixels.

Inputs:

Output: IMAGE — a normal RGB image you can finally look at.

7. Save Image

The terminal node. It takes an IMAGE and writes it to ComfyUI/output/. The filename_prefix widget controls the start of the filename.

There is also a Preview Image node that displays the image in the browser without saving. Useful while you’re tuning prompts.

Reading the data flow

Look at the default workflow as a sentence:

Load Checkpoint provides MODEL, CLIP, and VAE. ↓ The CLIP feeds two CLIP Text Encode nodes (positive and negative prompts), which output two CONDITIONINGs. ↓ Empty Latent Image produces a blank canvas. ↓ KSampler combines MODEL + CONDITIONINGs + LATENT and runs denoising. ↓ VAE Decode turns the resulting latent into pixels using the VAE. ↓ Save Image writes the pixels to disk.

Once you can read this in your head, you can read any workflow.

Adding nodes to the canvas

Three ways to add a node:

  1. Right-click on empty canvas → Add Node → drill through the menu by category
  2. Double-click on empty canvas → search box. Start typing KSampler, CLIP, etc. — fastest method
  3. Drag a wire from a socket into empty space → release. ComfyUI shows only nodes with a matching socket type

The double-click search is the most useful. Memorize it.

Connecting and disconnecting wires

If you try to connect mismatched types, the wire snaps back. The colors are a hint about what’s allowed.

Custom nodes

The seven nodes above are core nodes — they ship with ComfyUI. The community has built thousands more. ControlNet preprocessors, IP-Adapter, animation, video, upscaling, face restoration. They install as folders inside custom_nodes/.

The most popular installer is ComfyUI-Manager. It adds a “Manager” button to the UI from which you can search and install custom node packs. Most workflow JSON files you’ll find online expect specific custom nodes. ComfyUI-Manager will detect what’s missing and offer to install it.

For a first week with ComfyUI, ignore custom nodes. Build comfort with the core graph first.

Common beginner mistakes

What’s next

You now understand the seven core nodes and how data flows between them. The next step is to actually build a workflow yourself — starting from an empty canvas, dragging in each node, wiring them up, and generating your first image. That walkthrough is the next guide.

#nodes#getting-started#fundamentals#stable-diffusion