Blog/May 18, 2026·7 min read

How to Make a 4-Panel Comic (Yonkoma)

The 4-panel comic is the smallest unit of comic storytelling that still feels like a complete story. Done right, four panels can land a setup, a development, a twist, and a punchline. Done wrong, four panels feel like three panels with one extra. The difference is structural — and the structure most working 4-panel comics use is Japanese in origin. This is what yonkoma is, the four-act formula behind it, and how to write one that lands.

What 4-Panel Comics Are For

Four panels is the format for self-contained beats. A daily newspaper strip (Peanuts, Calvin and Hobbes, Garfield) typically runs 3–4 panels per strip. Japanese yonkoma (literally 'four cells') runs vertically in magazines and reads as one self-contained gag or beat per strip. Web comic strips on Instagram and Twitter often use the 4-panel grid because it's the shape that fits a single mobile screen. The shared trait: each strip should land without prior context, even when serialized.

Kishōtenketsu — The Four-Act Formula

Yonkoma uses a Japanese narrative structure called kishōtenketsu (起承転結), which translates roughly as 'introduction, development, twist, conclusion.' Unlike Western three-act structure, which relies on rising conflict, kishōtenketsu doesn't require conflict at all — the twist in panel 3 is a reframing or a new angle rather than an escalating problem. Slice-of-life comics use this structure naturally; the strip doesn't need a problem to feel like a complete story, just a setup that lands when reframed.

Panel 1 — 起 (Ki, Introduction)

Establish the scene, character, and starting situation. The reader knows who is involved, where they are, and what is happening. No conflict yet — just the world.

Panel 2 — 承 (Shō, Development)

Continue the introduced situation. Add a small piece of information, advance the action one beat, or deepen the context. The reader settles into the rhythm and assumes they know where this is going.

Panel 3 — 転 (Ten, Twist)

Introduce something unexpected — a new element, an off-angle observation, a hidden context the reader hadn't been shown. The reader's mental model of the situation breaks or rotates. This is the panel that does the work; weak yonkoma have weak panel 3s.

Panel 4 — 結 (Ketsu, Conclusion)

Resolve the twist. Often this is the punchline (the comedy lands), but it can also be a reframing (the meaning shifts), a quiet emotional payoff (the slice-of-life beat), or a reveal (the hidden context is named).

Kishōtenketsu vs Three-Act in 4 Panels

Western 4-panel strips sometimes use compressed three-act: setup (panel 1) → complication (panel 2) → escalation (panel 3) → punchline (panel 4). This works for joke strips but feels mechanical for slice-of-life or character beats. Kishōtenketsu fits a wider range of tones because the twist doesn't have to be a conflict — it can be a small observation, a change of perspective, a moment of grace. Most working yonkoma artists prefer kishōtenketsu because it lets the strip carry character and atmosphere, not just gags.

Free Template — Yonkoma Beat Sheet

Copy this and fill it in per strip: ``` STRIP TITLE: <name> CAST: <names, 1–3> SETTING: <one line> Panel 1 (起 Ki, Introduction): Action: <one line> Dialogue: <8–15 words total> Panel 2 (承 Shō, Development): Action: <one line, extends panel 1> Dialogue: <8–15 words> Panel 3 (転 Ten, Twist): Action: <one line, introduces unexpected element> Dialogue: <8–15 words, or silent> Panel 4 (結 Ketsu, Conclusion): Action: <one line, lands the twist> Dialogue: <8–15 words, or silent> ``` Four lines of action, 40–60 words of dialogue total. If the strip can't be written within those constraints, it's a 6-panel or 8-panel strip — not a yonkoma.

Famous 4-Panel Examples (Structure Breakdown)

Three classic strips analyzed beat-by-beat:

Peanuts (Charles Schulz)

Panel 1: Charlie Brown holds the football. Panel 2: Lucy waits. Panel 3: Charlie Brown runs. Panel 4: Lucy pulls the ball; Charlie Brown flies through the air. Western three-act compressed — setup, development, punchline. Reads instantly because the kids learned the structure across decades of repetition.

Yotsubato! (Kiyohiko Azuma)

Panel 1: Yotsuba sees something new in the garden. Panel 2: Yotsuba inspects it (a flower, a frog, a sprinkler). Panel 3: Yotsuba reaches a wrong but charming conclusion about what it is. Panel 4: Yotsuba reacts. Pure kishōtenketsu — no conflict, the twist is the misunderstanding, the conclusion is the kid's reaction. Slice-of-life perfection.

Azumanga Daioh (Kiyohiko Azuma)

Panel 1: Osaka spaces out in class. Panel 2: She has an oblique thought. Panel 3: She shares the thought with classmates. Panel 4: The classmates' reaction lands the joke. Kishōtenketsu with character voice carrying the twist.

Common 4-Panel Mistakes

Four errors that flag amateur yonkoma. One: panel 3 doesn't twist — the development just continues, and panel 4 feels like a fourth development panel. Two: dialogue is too dense — 4-panel strips that need 20+ words per balloon read as too long for the format. Three: the setup is too complex — a yonkoma can't open with three named characters and a backstory; pick one or two characters max. Four: the punchline depends on prior strips — most readers see one yonkoma in isolation; if the joke needs continuity, it's a longer-form story compressed wrong.

Using AI for 4-Panel Strips

Modern AI comic tools handle 4-panel format especially well — the small panel count fits within the pacing the AI naturally writes toward. Input a one-sentence premise, specify yonkoma or 4-panel comic, and the AI outputs a structured strip with kishōtenketsu pacing pre-applied. The strongest workflow: AI generates 5–10 yonkoma drafts from one premise; you pick the one with the strongest twist and polish the dialogue.

Questions

Frequently asked.

Format conventions. A Western comic strip is horizontal (panels side-by-side, read left-to-right) and runs in newspapers or web. A yonkoma is vertical (panels stacked top-to-bottom, read top-down) and runs in manga magazines. The structural difference: yonkoma typically uses kishōtenketsu; Western strips more often use compressed three-act with a punchline.

Aspect ratio varies by venue. Newspaper strips: 4:1 (wide and short). Web/Instagram: 1:1 square divided into 4 panels (2×2 grid), or 1:4 tall (4 panels stacked vertically). Manga yonkoma: 1:4 tall. Pick by where you'll publish.

Yes. Yonkoma is often used for slice-of-life, character study, or quiet emotional beats — not just comedy. Kishōtenketsu structure works for any tone. The expectation that 4-panel = funny is a Western newspaper convention, not a format requirement.

Solo creator hand-drawn: 1–4 hours per strip including writing, thumbnailing, drawing, and lettering. AI-assisted: 10–30 minutes per strip. The writing phase is the longest part of the work; the four-panel constraint forces tight editing.

Yes. Modern AI comic tools handle yonkoma especially well because the structure (kishōtenketsu) is well-represented in training data. Input a one-line premise, choose yonkoma format, and the AI outputs a pre-structured strip. Edit for voice and the twist quality — those are the parts AI does roughly.

Ready when you are.

Start with a paragraph.
Finish with a comic.

Sign up free, draft your first script in seconds, then upgrade to render the comic.