The Midjourney Book — by Rupert Chesman

Front MatterHow to read this book

Midjourney rewards people who learn to describe what they see. This book teaches that skill the way a photography book would — with pictures first, side-by-side comparisons, and the smallest possible change between any two images.

Every figure follows one discipline: change only the variable being taught. Same subject, same seed where it matters, one swapped phrase or parameter. The captions tell you what moved; your eyes do the rest. Every prompt printed beside a figure is copy-pasteable.

Current as of June 2026 — read this box first

Midjourney moves fast. As this book went to press: V7 is the documented platform default (released April 2025); V8.1 is the newest model (April 30, 2026) — 4–5× faster, native 2K “HD” output — but the whole V8 line is officially alpha and changing, sometimes without notice. Several mature features (Omni Reference, multi-prompting, Niji, the Quality parameter, Draft Mode, Turbo) live in V7 and are not yet in V8.1. Every figure in this book is tagged with the model that produced it. Verify anything version-specific against docs.midjourney.com before you rely on it.

You pay in GPU time, not in images

The single most misunderstood thing for beginners. Plans ($10–$120/month) buy Fast GPU hours — a standard image costs roughly one GPU-minute; V8.1 HD ≈ 1.33 minutes; video ≈ 8× an image. Standard plans and above add unlimited Relax mode (queued, slower, free). Explore in Relax, spend Fast hours on finals. Unused Fast hours don't roll over.

1.1The very first prompt — anatomy of a prompt V8.1

Midjourney's prompt grammar is simple: subject → context → lighting → style/camera → parameters. Each layer you add removes a decision from the model and hands it to you.

Start with two words and Midjourney fills every gap with its own taste. Add a setting and it stops guessing where you are. Add light and it stops guessing when. Add a lens and it stops guessing how the scene was seen. The fox below never changes species — only how much of the picture is yours.

Fig 1.1Same animal, escalating control. Subject → context → lighting → style/camera + parameters. a fox a red fox sitting in a forest a red fox sitting in a misty pine forest at dawn, soft golden light a red fox sitting in a misty pine forest at dawn, soft golden backlight, shallow depth of field, photorealistic, shot on 85mm lens --ar 3:2 --style raw

Fig 1.1Same animal, escalating control. Subject → context → lighting → style/camera + parameters. a fox a red fox sitting in a forest a red fox sitting in a misty pine forest at dawn, soft golden light a red fox sitting in a misty pine forest at dawn, soft golden backlight, shallow depth of field, photorealistic, shot on 85mm lens --ar 3:2 --style raw

fox grid in the web app — Fig 1.1bWhat you actually see. Every job returns four candidates in the web app's Create feed. Notice how the variety *within* each grid shrinks as the prompt gets more specific — fewer gaps for the model to fill.

red fox forest grid — Fig 1.1bWhat you actually see. Every job returns four candidates in the web app's Create feed. Notice how the variety *within* each grid shrinks as the prompt gets more specific — fewer gaps for the model to fill.

1.2The grid and the V8.1 Creation Actions V8.1

One prompt, four candidates — then a small row of verbs: Vary (Subtle/Strong), Rerun, Edit, Use (Image/Style/Prompt), Animate.

V8.1's action row is leaner than the classic Discord-era buttons. There is no standalone Upscale — images render at native HD (2K) — and no one-click Zoom Out or Pan; canvas work lives inside Edit. If you've seen U1–U4 / V1–V4 buttons in older tutorials, that's the Discord / V6-era interface.

balloon grid — Fig 1.2One prompt, four candidates — and the verbs that act on them. Hovering a tile reveals Vary Subtle, Vary Strong and Animate; the full action set sits in the panel to the right. a vintage hot air balloon drifting over a patchwork of green farmland, morning haze, gentle clouds --ar 3:2 --v 8.1

Vary Subtle / Vary Strong / Animate — Fig 1.2One prompt, four candidates — and the verbs that act on them. Hovering a tile reveals Vary Subtle, Vary Strong and Animate; the full action set sits in the panel to the right. a vintage hot air balloon drifting over a patchwork of green farmland, morning haze, gentle clouds --ar 3:2 --v 8.1

selected balloon — Fig 1.2bRerun is V8.1's reroll. The same prompt run twice: same brief, different dice. Curate across jobs, not just within one grid.

rerun balloon — Fig 1.2bRerun is V8.1's reroll. The same prompt run twice: same brief, different dice. Curate across jobs, not just within one grid.

Vary Subtle result grid — Fig 1.2cSubtle keeps the painting; Strong repaints it. The same chosen balloon, varied two ways: Subtle nudges clouds and light while the composition holds; Strong renegotiates framing, sky and landscape. Use Subtle to polish a keeper, Strong to escape a near-miss.

Vary Strong result grid — Fig 1.2cSubtle keeps the painting; Strong repaints it. The same chosen balloon, varied two ways: Subtle nudges clouds and light while the composition holds; Strong renegotiates framing, sky and landscape. Use Subtle to polish a keeper, Strong to escape a near-miss.

1.3Expanding the canvas via Edit V8.1 · Edit

On V8.1, the canvas-expansion tools aren't buttons on the result — you open Edit, widen the frame, and generate into the empty space. The original pixels stay put.

Keep the added-region prompt short and literal — you're describing only what the new space should contain, not re-describing the whole picture.

lighthouse base — Fig 1.3The Edit workflow, step by step. A square lighthouse becomes a panorama: open Edit, drag the frame wider, prompt only the new region. a lone lighthouse on a rocky cliff, dramatic stormy sky, crashing waves --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: rocky coastline and ocean

editor open — Fig 1.3The Edit workflow, step by step. A square lighthouse becomes a panorama: open Edit, drag the frame wider, prompt only the new region. a lone lighthouse on a rocky cliff, dramatic stormy sky, crashing waves --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: rocky coastline and ocean

extended lighthouse panorama — Fig 1.3bThe finished extension (29:18). The original square survives untouched in the centre-right; everything else was imagined to order. On Discord / V6.1 this is the one-click *Zoom Out / Pan* — label your figures by interface.

1.4Describe — reverse-prompting workflow

Midjourney can run backwards: feed it an image, and Describe returns four prompts that would plausibly produce it. It is the fastest way to learn how the model "hears" pictures.

Describe panel showing the uploaded photo and four suggested prompts — Fig 1.4Midjourney running backwards. One uploaded photo (right) comes back as four candidate prompts — each a different reading of the same image: the scene, the materials, the lighting, the mood. Reading them teaches you how the model "hears" pictures.

generated result: a young fashion designer sketching in a Parisian studio — Fig 1.4bFrom description back to image. One of the four prompts, lightly adapted and re-run — the loop closed. Adapt, don't just run: the returned prompts are raw material, not finished briefs. a young male fashion designer in a chic Parisian studio, sketching at a worktable, drapes of white fabric, soft window light

↑ All chapters

2.1Word order & front-loading V8.1

Midjourney weights earlier words more heavily. What you put first dominates the composition.

knight-first grid — Fig 2.1Identical words, reordered. Lead with the knight and you get portraits of a knight; lead with the castle and the architecture takes the frame. a knight in golden armor standing before a small grey castle a small grey castle behind a knight in golden armor

castle-first grid — Fig 2.1Identical words, reordered. Lead with the knight and you get portraits of a knight; lead with the castle and the architecture takes the frame. a knight in golden armor standing before a small grey castle a small grey castle behind a knight in golden armor

knight first pick — Fig 2.1bFull resolution, side by side. Knight-first fills the frame with armour; castle-first pushes the knight into the midground and lets the architecture breathe.

castle first pick — Fig 2.1bFull resolution, side by side. Knight-first fills the frame with armour; castle-first pushes the knight into the midground and lets the architecture breathe.

2.2Specific vs vague descriptors V8.1

"Nice" and "good" are decisions you're asking the model to make. Concrete nouns — heather, storm clouds, a distant loch — are decisions you've already made.

Fig 2.2Vague → pleasant but generic. Specific → a place. a nice landscape with good lighting a windswept Scottish highland glen, heather in bloom, low storm clouds breaking into golden shafts of light, distant loch --ar 16:9

Fig 2.2Vague → pleasant but generic. Specific → a place. a nice landscape with good lighting a windswept Scottish highland glen, heather in bloom, low storm clouds breaking into golden shafts of light, distant loch --ar 16:9

2.3The stylize dial — `--s` V8.1 · seed 2200

Stylize controls how much of Midjourney's own lush aesthetic gets imposed on your prompt. 0 = literal and plain; 750 = the house style takes over. Default is 100.

This is the most important dial in the book. Identical prompt, identical seed — only --s changes. Watch the humble jug become an ornament.

stylize 0 — Fig 2.3The stylize sweep. At 0 the jug is earthenware and the light is honest. By 750 every surface is hand-painted majolica and the lemons glow. Neither is "better" — they are different amounts of Midjourney. a still life of lemons and a ceramic jug on a wooden table, natural window light --s 0 --seed 2200 · then --s 100 · 400 · 750

stylize 100 — Fig 2.3The stylize sweep. At 0 the jug is earthenware and the light is honest. By 750 every surface is hand-painted majolica and the lemons glow. Neither is "better" — they are different amounts of Midjourney. a still life of lemons and a ceramic jug on a wooden table, natural window light --s 0 --seed 2200 · then --s 100 · 400 · 750

2.4The chaos dial — `--c` V8.1

Chaos doesn't change one image — it changes how different the four grid candidates are from each other. Show the whole grid, every time.

chaos 0 grid — Fig 2.4Four siblings → four strangers. At 0 the grid agrees on one idea. At 60 you get upside-down islands, dead trees and sunsets you never asked for — great for discovery, risky for briefs. a surreal floating island with a waterfall pouring into clouds --c 0 · then --c 25 · --c 60

chaos 25 grid — Fig 2.4Four siblings → four strangers. At 0 the grid agrees on one idea. At 60 you get upside-down islands, dead trees and sunsets you never asked for — great for discovery, risky for briefs. a surreal floating island with a waterfall pouring into clouds --c 0 · then --c 25 · --c 60

chaos 0 pick — Fig 2.4bOne pick from each grid, full resolution. The chaos-60 candidate barely remembers the brief — and that's the point.

chaos 25 pick — Fig 2.4bOne pick from each grid, full resolution. The chaos-60 candidate barely remembers the brief — and that's the point.

2.5The weird dial — `--w` V8.1

Where chaos varies the grid, weird bends each image itself away from convention. A teapot "designed by nature" is the perfect stress test.

weird 0 grid — Fig 2.5From teapot to driftwood sculpture. At 0 every candidate still pours tea. At 1000 the model keeps the soul of the prompt and abandons the object. a teapot designed by nature, organic forms --w 0 · then --w 250 · --w 1000

weird 250 grid — Fig 2.5From teapot to driftwood sculpture. At 0 every candidate still pours tea. At 1000 the model keeps the soul of the prompt and abandons the object. a teapot designed by nature, organic forms --w 0 · then --w 250 · --w 1000

weird 0 pick — Fig 2.5bThe full sweep at full resolution. At 0 it pours tea; at 250 it's driftwood that remembers being a teapot; at 1000 it's a striped orb in a wire cage that has clearly never met tea.

weird 250 pick — Fig 2.5bThe full sweep at full resolution. At 0 it pours tea; at 250 it's driftwood that remembers being a teapot; at 1000 it's a striped orb in a wire cage that has clearly never met tea.

2.6Multi-prompts & weights — `::` V6.1 · version-dependent

The double colon splits a prompt into separate concepts; numbers set their relative emphasis. space ship is one idea. space:: ship is two.

Officially supported in V6/V6.1 and version-dependent in V7/V8.1 — flag every multi-prompt figure with the model used.

space ship as one concept — Fig 2.6“space ship” — read as one concept. A spaceship, as expected. space ship

space and ship as two concepts — Fig 2.6bThe split and the weights. V6.1 With the double colon, “space” and “ship” become separate ingredients — a vessel sailing the cosmos rather than a spacecraft. Weights then set the mix: `balloon::2 city::1` keeps the balloon the star and the city the supporting act. space:: ship --ar 16:9 --v 6.1 hot air balloon::2 city below::1 --ar 3:2 --v 6.1

weighted balloon over city — Fig 2.6bThe split and the weights. V6.1 With the double colon, “space” and “ship” become separate ingredients — a vessel sailing the cosmos rather than a spacecraft. Weights then set the mix: `balloon::2 city::1` keeps the balloon the star and the city the supporting act. space:: ship --ar 16:9 --v 6.1 hot air balloon::2 city below::1 --ar 3:2 --v 6.1

2.7Negative prompting — `--no` V8.1

Telling Midjourney what to leave out. Imperfect, occasionally stubborn, but the simplest subtraction tool you have.

fruit bowl with bananas — Fig 2.7Subtraction by flag. A multi-word `--no` phrase parses poorly; keep it to simple nouns. a bowl of mixed fruit on a table a bowl of mixed fruit on a table --no bananas

fruit bowl no bananas — Fig 2.7Subtraction by flag. A multi-word `--no` phrase parses poorly; keep it to simple nouns. a bowl of mixed fruit on a table a bowl of mixed fruit on a table --no bananas

2.8Permutation prompts — `{ }` V8.1 · Fast/Turbo only

Curly braces turn one line into a batch: one prompt, three jobs, each consuming GPU time. The cleanest way to run a controlled material study.

ceramic vase — Fig 2.8One line → three jobs. Everything held constant except the material inside the braces. a {ceramic, glass, copper} vase holding white peonies, soft studio light --ar 4:5

glass vase — Fig 2.8One line → three jobs. Everything held constant except the material inside the braces. a {ceramic, glass, copper} vase holding white peonies, soft studio light --ar 4:5

2.9Image weight — `--iw` V7

When a prompt contains a reference image, --iw decides who wins: low = the text; high = the picture.

The reference here is this book's own lighthouse (Fig 1.3). The text asks for a castle on a hill — watch the lighthouse's storm, cliff and mood take over as --iw rises.

image weight 0.5 grid — Fig 2.9Who wins — the words or the picture? At 0.5 the text wins: fairy-tale castles, sunset skies. At 3 the reference wins: dark storms, rocky coast, crashing waves — castles that are lighthouses in everything but name. [lighthouse image] a castle on a hill --iw 0.5 --ar 16:9 --v 7 · then --iw 1.5 · --iw 3

image weight 1.5 grid — Fig 2.9Who wins — the words or the picture? At 0.5 the text wins: fairy-tale castles, sunset skies. At 3 the reference wins: dark storms, rocky coast, crashing waves — castles that are lighthouses in everything but name. [lighthouse image] a castle on a hill --iw 0.5 --ar 16:9 --v 7 · then --iw 1.5 · --iw 3

image weight 0.5 pick — Fig 2.9bOne pick from each weight, full resolution.

image weight 1.5 pick — Fig 2.9bOne pick from each weight, full resolution.

↑ All chapters

3.1Style reference basics — `--sref` V7

The subject stays constant; the look is borrowed from a numeric style code. Use --sref random to discover codes, then lock the keepers.

A style code is a coordinate in Midjourney's space of aesthetics — pure look, no content. The same rainy café below is rendered three times; only the code changes, and with it the entire visual language: warm storybook paint, lantern-lit old-world atmosphere, hard-edged flat colour.

café grid, sref 466919608 — Fig 3.1Same café, three borrowed looks. Each grid keeps its own internal consistency — that's what makes codes reusable across a whole project. (Codes are arbitrary numbers, not rankings; these were locked after exploring with `--sref random`.) a quiet city street café in the rain --sref 466919608 · then --sref 1234567890 · --sref 987654321

café grid, sref 1234567890 — Fig 3.1Same café, three borrowed looks. Each grid keeps its own internal consistency — that's what makes codes reusable across a whole project. (Codes are arbitrary numbers, not rankings; these were locked after exploring with `--sref random`.) a quiet city street café in the rain --sref 466919608 · then --sref 1234567890 · --sref 987654321

café pick, code 466919608 — Fig 3.1bOne pick from each code, full resolution. Subject, weather and time of day survive every restyle — only the aesthetic moves.

café pick, code 1234567890 — Fig 3.1bOne pick from each code, full resolution. Subject, weather and time of day survive every restyle — only the aesthetic moves.

3.2Style weight — `--sw` V7 · sref 1234567890

How hard the borrowed style is pushed. 0–1000, default 100 — at 50 the code whispers; at 400 it shouts.

style weight 50 grid — Fig 3.2The same code at three volumes. One mountain landscape, one style code, three weights — the painterly hand grows steadier and more insistent as `--sw` climbs. a mountain landscape --sref 1234567890 --sw 50 · then --sw 100 · --sw 400

style weight 100 grid — Fig 3.2The same code at three volumes. One mountain landscape, one style code, three weights — the painterly hand grows steadier and more insistent as `--sw` climbs. a mountain landscape --sref 1234567890 --sw 50 · then --sw 100 · --sw 400

style weight 50 pick — Fig 3.2bFull-resolution picks. The sw-100 frame is the regenerated `--no animals` version — the original run smuggled a mountain lion onto its rock (kept in Pics/ as a souvenir of Midjourney filling gaps the prompt left open).

style weight 100 pick — Fig 3.2bFull-resolution picks. The sw-100 frame is the regenerated `--no animals` version — the original run smuggled a mountain lion onto its rock (kept in Pics/ as a souvenir of Midjourney filling gaps the prompt left open).

3.3Blending multiple style references V7

Stack space-separated codes to fuse aesthetics. One code is a borrowed look; two codes are a negotiation.

astronaut grid, single style code — Fig 3.3Solo vs duet. A single code commits hard to one aesthetic (here: neon geometry). Stacking the book's two familiar codes from Fig 3.1 produces something neither would do alone — warm painterly texture with graphic colour. Start with one before stacking; many srefs at high weight create a muddled tug-of-war. a portrait of an astronaut --sref 466502964 --sw 150 --ar 16:9 --v 7 a portrait of an astronaut --sref 1234567890 987654321 --sw 150 --ar 16:9 --v 7

astronaut grid, blended style codes — Fig 3.3Solo vs duet. A single code commits hard to one aesthetic (here: neon geometry). Stacking the book's two familiar codes from Fig 3.1 produces something neither would do alone — warm painterly texture with graphic colour. Start with one before stacking; many srefs at high weight create a muddled tug-of-war. a portrait of an astronaut --sref 466502964 --sw 150 --ar 16:9 --v 7 a portrait of an astronaut --sref 1234567890 987654321 --sw 150 --ar 16:9 --v 7

blended-style astronaut, full resolution — Fig 3.3bThe blend at full resolution. Codes are space-separated, not comma-separated — and the syntax matters: bracketed or doubled `--sref` flags parse differently (we have the failed attempts to prove it).

3.4Omni Reference — the consistency centerpiece V7 only

One reference image, four scenes, one recognisable explorer. This is the technique the identity-drift warnings in Parts 2 and 6 have been pointing at.

Market, ridge, tent, campfire — different light, different weather, different framing, same red scarf and same face. Keep --ow ≤ 400 so the reference doesn't override the scene, and budget 2× GPU per job. Not yet in V8.1.

explorer in a market — Fig 3.4Four postcards from one person. The same reference image used for the 3.5 weight sweep, held at `--ow 200` throughout. a young explorer with a red scarf standing in a bustling market --oref [reference] --ow 200 --ar 3:2 --v 7 · climbing a snowy ridge · reading a map by lantern light in a tent · laughing at a campfire, night

explorer climbing a snowy ridge — Fig 3.4Four postcards from one person. The same reference image used for the 3.5 weight sweep, held at `--ow 200` throughout. a young explorer with a red scarf standing in a bustling market --oref [reference] --ow 200 --ar 3:2 --v 7 · climbing a snowy ridge · reading a map by lantern light in a tent · laughing at a campfire, night

prompt bar with Omni Reference attachment menu — Fig 3.4bWhere the magic attaches. The paperclip menu offers three kinds of reference — Image Prompts (use the elements), Style References (use the look), and **Omni Reference** (use a person's likeness or an object's form). Same image, three very different jobs.

3.5Omni weight — `--ow` V7

With an Omni Reference attached, --ow decides how much the reference image rules. At 50 the character dissolves into the prompt; at 400 he's unmistakably the same man in every frame.

This is the dial behind consistent characters. Low weight treats the reference as a loose suggestion — the grids invent new faces, wardrobes, even whole genres. By 200 the face locks; by 400 the wardrobe and kit lock too. Keep --ow ≤ 400 so the reference doesn't fight the scene, and budget for it: Omni Reference costs 2× GPU.

omni weight 50 grid — Fig 3.5One reference, three loyalties. At 50, four strangers in four art styles. At 200, the reference face appears in every tile. At 400, face, scarf and gear all hold. the explorer in a desert --oref [reference] --ow 50 --ar 16:9 · then --ow 200 · --ow 400

omni weight 200 grid — Fig 3.5One reference, three loyalties. At 50, four strangers in four art styles. At 200, the reference face appears in every tile. At 400, face, scarf and gear all hold. the explorer in a desert --oref [reference] --ow 50 --ar 16:9 · then --ow 200 · --ow 400

omni weight 50 pick — Fig 3.5bFull-resolution picks across the sweep. The two ow-50 frames (from two separate jobs) don't even agree with each other; the ow-200 and ow-400 frames are recognisably one person.

omni weight 50 alt pick — Fig 3.5bFull-resolution picks across the sweep. The two ow-50 frames (from two separate jobs) don't even agree with each other; the ow-200 and ow-400 frames are recognisably one person.

3.6Moodboards & personalization — `--p` V7

A curated 5–10 image moodboard averaged into a reusable house style. Strength is driven by --s, not --sw.

Where a style code borrows one look, a moodboard distils your taste — feed it a handful of images you love and reference it by ID. The cups below carry the board's palette and mood without copying any single source image.

moodboard coffee cup grid — Fig 3.6The board speaks through the grid. All four candidates share a palette and temperament no plain prompt would produce. a brand lifestyle photo of a coffee cup on a linen tablecloth --p [My First Moodboard] --s 300

moodboard coffee cup pick 1 — Fig 3.6bTwo picks, one house style. Different compositions, same brand world — exactly what a moodboard is for.

moodboard coffee cup pick 2 — Fig 3.6bTwo picks, one house style. Different compositions, same brand world — exactly what a moodboard is for.

↑ All chapters

4.1Default vs Raw — the single most useful realism comparison V8.1 · seed 88

Same prompt, same seed; only --style raw added. Default applies Midjourney's signature beautifying hand — Raw strips it for a more literal, documentary result.

Look at what changes and what doesn't. The bakery, the flour, the morning light all survive. What Raw removes is the cinema: the default grid leans into moody shadows and hero lighting, while Raw settles for a man at work who happens to be photographed well. For realism recipes, Raw is the foundation everything else builds on.

baker grid, default style — Fig 4.1Two grids, one seed. The default grid is graded like a film still; the Raw grid is lit like a documentary. a candid portrait of an elderly baker dusted with flour, morning light in a bakery --seed 88 a candid portrait of an elderly baker dusted with flour, morning light in a bakery --style raw --seed 88

baker grid, raw style — Fig 4.1Two grids, one seed. The default grid is graded like a film still; the Raw grid is lit like a documentary. a candid portrait of an elderly baker dusted with flour, morning light in a bakery --seed 88 a candid portrait of an elderly baker dusted with flour, morning light in a bakery --style raw --seed 88

baker default pick — Fig 4.1bFull-resolution picks. Default poses him; Raw catches him.

baker raw pick — Fig 4.1bFull-resolution picks. Default poses him; Raw catches him.

4.2The realism recipe — Raw + low stylize V8.1

Pair --style raw with a modest stylize value (100–250) and documentary language, and Midjourney stops painting and starts reporting.

fisherman mending nets, raw s150 — Fig 4.2The working formula. Overcast light, the word “documentary,” Raw mode, stylize 150 — nothing glamorous, everything believable. a weathered fisherman mending nets on a harbor wall, overcast light, documentary --style raw --s 150 --ar 4:5

fisherman mending nets, raw s150, second pick — Fig 4.2The working formula. Overcast light, the word “documentary,” Raw mode, stylize 150 — nothing glamorous, everything believable. a weathered fisherman mending nets on a harbor wall, overcast light, documentary --style raw --s 150 --ar 4:5

↑ All chapters

5.1The medium gallery — one pear, twelve hands V8.1

One subject, one sentence, twelve mediums. Naming the medium and its technique — “wet-on-wet washes,” “heavy impasto,” “lead lines, glowing backlight” — is the most powerful style lever in plain language.

This is the gallery-wall spread of the book. Read it like a museum room: the fruit never changes, only the hand that made it. Notice how the technique words do the real work — “cross-hatching” builds the graphite, “subsurface scattering” lights the 3D render from inside.

pear, watercolor — Fig 5.1Twelve mediums, one pear. Every prompt begins “a single ripe pear on a wooden table,” then names the medium and two or three technique words. a single ripe pear on a wooden table, watercolor painting, soft wet-on-wet washes, textured paper · oil painting, heavy impasto, palette knife · graphite pencil sketch, cross-hatching · charcoal, smudged shading · gouache, flat matte color · pen and ink line art · 3D render, Octane, subsurface scattering · low-poly 3D · stained glass, lead lines, glowing backlight · pixel art, 32-bit · woodcut print, two-color · photorealistic, 100mm macro --style raw

pear, oil impasto — Fig 5.1Twelve mediums, one pear. Every prompt begins “a single ripe pear on a wooden table,” then names the medium and two or three technique words. a single ripe pear on a wooden table, watercolor painting, soft wet-on-wet washes, textured paper · oil painting, heavy impasto, palette knife · graphite pencil sketch, cross-hatching · charcoal, smudged shading · gouache, flat matte color · pen and ink line art · 3D render, Octane, subsurface scattering · low-poly 3D · stained glass, lead lines, glowing backlight · pixel art, 32-bit · woodcut print, two-color · photorealistic, 100mm macro --style raw

5.2Watercolour — a deeper dive V8.1

One medium, three temperaments: loose and bleeding, light and architectural, precise and scientific. The technique words steer it.

loose watercolor poppies — Fig 5.2Three watercolours that share almost no DNA. “Wet-on-wet bleed” loosens, “minimal linework, light and airy” lightens, “scientific illustration style” tightens. loose watercolor painting of bright orange California poppies in a meadow, wet-on-wet bleed, white paper showing through --ar 3:2 delicate watercolor cityscape of Venice canals, soft washes, minimal linework, light and airy --ar 16:9 botanical watercolor study of fern fronds, precise detail, scientific illustration style --ar 4:5

watercolor Venice canals — Fig 5.2Three watercolours that share almost no DNA. “Wet-on-wet bleed” loosens, “minimal linework, light and airy” lightens, “scientific illustration style” tightens. loose watercolor painting of bright orange California poppies in a meadow, wet-on-wet bleed, white paper showing through --ar 3:2 delicate watercolor cityscape of Venice canals, soft washes, minimal linework, light and airy --ar 16:9 botanical watercolor study of fern fronds, precise detail, scientific illustration style --ar 4:5

5.3Oil painting — a deeper dive V8.1

Impasto, Old Master, Impressionist — three centuries of oil in three prompts. Era language is style language.

Fig 5.3Same pigment, different centuries. expressive oil portrait of a young woman, heavy impasto, visible palette-knife strokes, chiaroscuro --ar 4:5 --s 600 classical oil painting of a storm at sea, dramatic baroque lighting, Old Master style --ar 3:2 impressionist oil painting of a sunlit garden, dappled light, broken color, loose brushwork --ar 3:2

classical oil storm at sea — Fig 5.3Same pigment, different centuries. expressive oil portrait of a young woman, heavy impasto, visible palette-knife strokes, chiaroscuro --ar 4:5 --s 600 classical oil painting of a storm at sea, dramatic baroque lighting, Old Master style --ar 3:2 impressionist oil painting of a sunlit garden, dappled light, broken color, loose brushwork --ar 3:2

5.4Anime & Niji Niji 7

Niji 7 uses “precision prompting” — it renders what you specify and invents little, the opposite of V7's gap-filling. Describe the type of character; never name a copyrighted one.

anime action scene, cyberpunk swordswoman — Fig 5.4Niji's range. Neon action, lantern-lit comfort, storybook whimsy, CRT-glow nostalgia — all from one model, all from explicit description. dynamic anime action scene, a young swordswoman deflecting glowing energy, neon cyberpunk city at night --ar 16:9 --niji 7 · cozy slice-of-life ramen stall · whimsical child and forest spirit · 90s retro cel mecha pilot, CRT glow, film grain --ar 4:3

cozy anime ramen stall — Fig 5.4Niji's range. Neon action, lantern-lit comfort, storybook whimsy, CRT-glow nostalgia — all from one model, all from explicit description. dynamic anime action scene, a young swordswoman deflecting glowing energy, neon cyberpunk city at night --ar 16:9 --niji 7 · cozy slice-of-life ramen stall · whimsical child and forest spirit · 90s retro cel mecha pilot, CRT glow, film grain --ar 4:3

5.5Photorealism & hyperrealism V8.1 · raw

Material realism: skin pores, refracted dew, condensation. Texture language plus --style raw — camera grammar gets all of Part 6.

extreme close-up weathered fisherman — Fig 5.5Realism is a materials problem. “Visible skin pores,” “refracted light,” “water droplets” — name the physics and the model renders it. extreme close-up portrait of a weathered fisherman, visible skin pores, catchlight in the eyes --style raw --s 150 --ar 4:5 · hyperrealistic macro of a dewdrop on a spider's web at dawn --ar 1:1 · photorealistic still life of condensation on a cold glass of iced coffee --ar 4:5

dewdrop on spider web macro — Fig 5.5Realism is a materials problem. “Visible skin pores,” “refracted light,” “water droplets” — name the physics and the model renders it. extreme close-up portrait of a weathered fisherman, visible skin pores, catchlight in the eyes --style raw --s 150 --ar 4:5 · hyperrealistic macro of a dewdrop on a spider's web at dawn --ar 1:1 · photorealistic still life of condensation on a cold glass of iced coffee --ar 4:5

5.6Concept art, matte painting & illustration V8.1

Three professional registers: cinematic environment, industrial design sheet, picture-book warmth.

ruined floating temple concept art — Fig 5.6Job-specific vocabularies. “Orthographic views, annotations” turns a picture into a design document; “soft textured shapes, warm palette” turns it into a bedtime story. epic fantasy environment concept art, a ruined floating temple above misty peaks, matte painting --ar 16:9 --s 600 · sci-fi vehicle concept sheet, orthographic views, annotations · children's book illustration, a friendly whale carrying a lighthouse --ar 3:2

sci-fi rover concept sheet — Fig 5.6Job-specific vocabularies. “Orthographic views, annotations” turns a picture into a design document; “soft textured shapes, warm palette” turns it into a bedtime story. epic fantasy environment concept art, a ruined floating temple above misty peaks, matte painting --ar 16:9 --s 600 · sci-fi vehicle concept sheet, orthographic views, annotations · children's book illustration, a friendly whale carrying a lighthouse --ar 3:2

5.7Graphic, vector & flat styles V8.1

Flat colour, overprint grain, isometric miniatures — graphic-design looks respond to print-trade vocabulary.

flat vector city skyline at sunset — Fig 5.7Speak the printer's language. “Two-color overprint, grainy texture” is what makes a riso a riso. minimalist flat vector illustration of a city skyline at sunset, limited palette --ar 16:9 · bold risograph print poster, two-color overprint, grainy texture, retro --ar 4:5 · isometric 3D illustration of a tiny cozy coffee shop --ar 1:1

Fig 5.7Speak the printer's language. “Two-color overprint, grainy texture” is what makes a riso a riso. minimalist flat vector illustration of a city skyline at sunset, limited palette --ar 16:9 · bold risograph print poster, two-color overprint, grainy texture, retro --ar 4:5 · isometric 3D illustration of a tiny cozy coffee shop --ar 1:1

↑ All chapters

6.1The flagship comparison — Hasselblad vs iPhone vs disposable V8.1

Midjourney was trained on millions of captioned photographs, so a camera's name carries its whole visual culture. Same woman, same kitchen, same morning — three cameras.

“Hasselblad X2D, medium format” summons editorial polish: controlled light, perfect falloff. “Shot on an iPhone” loosens everything — wider, brighter, casual. “Disposable film camera, direct flash” drags the scene twenty years backwards into snapshot nostalgia. None of these cameras exist in the image; only their reputations do.

kitchen portrait, Hasselblad look — Fig 6.1Three cameras that were never there. a young woman laughing at a kitchen table, morning light, shot on a Hasselblad X2D, 80mm, medium format --seed 500 --style raw · shot on an iPhone · shot on a disposable film camera, direct flash

kitchen portrait, iPhone look — Fig 6.1Three cameras that were never there. a young woman laughing at a kitchen table, morning light, shot on a Hasselblad X2D, 80mm, medium format --seed 500 --style raw · shot on an iPhone · shot on a disposable film camera, direct flash

6.2Camera body associations V8.1 · seed 12

One stylish older man, six cameras. Each body name nudges colour, contrast, and grain toward the photographers who famously used it.

street portrait, Leica look — Fig 6.2Six reputations, one face (or so the seed intends). Documentary classicism from the Leica, clinical sharpness from the mirrorless flagships, cinematic grading from the RED, slow-stare formality from large format. Note the model drifts on the man's identity between jobs — camera language steers *look*, not character consistency; that's Omni Reference's job (Part 3). a street portrait of a stylish older man, overcast daylight, shot on a Leica M6, 35mm film --seed 12 --style raw · Canon EOS R5, 50mm · Sony A7R V, 85mm · Nikon Z9 · a RED cinema camera, anamorphic · a 4×5 large format film camera

street portrait, Canon look — Fig 6.2Six reputations, one face (or so the seed intends). Documentary classicism from the Leica, clinical sharpness from the mirrorless flagships, cinematic grading from the RED, slow-stare formality from large format. Note the model drifts on the man's identity between jobs — camera language steers *look*, not character consistency; that's Omni Reference's job (Part 3). a street portrait of a stylish older man, overcast daylight, shot on a Leica M6, 35mm film --seed 12 --style raw · Canon EOS R5, 50mm · Sony A7R V, 85mm · Nikon Z9 · a RED cinema camera, anamorphic · a 4×5 large format film camera

6.3Focal length — the walk backwards V8.1 · seed 77

From 24mm to 200mm, the camera “walks away” while the lens pulls back in. Watch the background compress from sprawling plaza to wall of bokeh.

24mm plaza shot — Fig 6.3The full sweep, 24mm to 200mm. At 24mm the plaza is the subject; by 50mm the figure owns the frame; at 85mm it's a portrait; at 200mm the city is wallpaper. a person standing in a city plaza, 24mm wide angle --seed 77 --ar 16:9 · 35mm · 50mm · 85mm portrait · 200mm telephoto, heavy background compression

35mm plaza shot — Fig 6.3The full sweep, 24mm to 200mm. At 24mm the plaza is the subject; by 50mm the figure owns the frame; at 85mm it's a portrait; at 200mm the city is wallpaper. a person standing in a city plaza, 24mm wide angle --seed 77 --ar 16:9 · 35mm · 50mm · 85mm portrait · 200mm telephoto, heavy background compression

85mm grid — Fig 6.3bThe 85mm stop, grid view. The lens steps fully into portrait territory — chest-up, melting background.

6.4Aperture — f/1.2 vs f/8 V8.1

One number decides whether the world behind your subject exists.

f/1.2 grid — Fig 6.4Creamy bokeh vs everything-in-focus. a coffee cup on a busy café table, 50mm at f/1.2, creamy bokeh, background melts away --ar 16:9 a coffee cup on a busy café table, 50mm at f/8, everything in sharp focus --ar 16:9

f/8 grid — Fig 6.4Creamy bokeh vs everything-in-focus. a coffee cup on a busy café table, 50mm at f/1.2, creamy bokeh, background melts away --ar 16:9 a coffee cup on a busy café table, 50mm at f/8, everything in sharp focus --ar 16:9

coffee cup at f/1.2 — Fig 6.4bFull resolution, side by side. At f/1.2 the café dissolves into discs of light; at f/8 every chair, cup and window holds focus.

coffee cup at f/8 — Fig 6.4bFull resolution, side by side. At f/1.2 the café dissolves into discs of light; at f/8 every chair, cup and window holds focus.

6.5Specialty lenses V8.1

Tilt-shift miniaturises a city; fisheye bends a trick into a bubble; a macro lens turns a wildflower field into a single bee's world.

tilt-shift city grid — Fig 6.5Three lenses that are really three ideas. a city intersection from above, tilt-shift lens, miniature toy-town effect · a skateboarder mid-trick, fisheye lens, extreme distortion · a field of wildflowers, macro lens, single bee in sharp focus --ar 4:5

fisheye skateboarder grid — Fig 6.5Three lenses that are really three ideas. a city intersection from above, tilt-shift lens, miniature toy-town effect · a skateboarder mid-trick, fisheye lens, extreme distortion · a field of wildflowers, macro lens, single bee in sharp focus --ar 4:5

tilt-shift city, full resolution — Fig 6.5bAll three lenses, full resolution. A toy-town intersection, a trick bent into a bubble, and one bee's universe.

fisheye skateboarder, full resolution — Fig 6.5bAll three lenses, full resolution. A toy-town intersection, a trick bent into a bubble, and one bee's universe.

6.6Film stocks V8.1

Four emulsions, one couple, one sunset. Film-stock names are colour-grading presets hiding in plain language.

beach couple, Portra — Fig 6.6Same beach, four chemistries. Portra keeps skin honest and warm; Velvia super-saturates the sky; CineStill cools into tungsten halation; Ilford strips it to silver. a couple walking on a beach at sunset, shot on Kodak Portra 400, warm natural skin tones, film grain · Fujifilm Velvia 50, ultra-saturated · CineStill 800T, tungsten, halation · Ilford HP5, black and white, classic grain

beach couple, Velvia — Fig 6.6Same beach, four chemistries. Portra keeps skin honest and warm; Velvia super-saturates the sky; CineStill cools into tungsten halation; Ilford strips it to silver. a couple walking on a beach at sunset, shot on Kodak Portra 400, warm natural skin tones, film grain · Fujifilm Velvia 50, ultra-saturated · CineStill 800T, tungsten, halation · Ilford HP5, black and white, classic grain

6.7The lighting cheat-sheet V8.1 · seed 4

Six lighting names, one studio portrait. This spread is the page readers will dog-ear.

Rembrandt lighting portrait — Fig 6.7Light is a vocabulary. Each term sets direction, temperature and mood in one word. studio portrait of a woman, Rembrandt lighting, single soft key, dark background --seed 4 --ar 16:9 · golden hour backlight, warm rim light, lens flare · blue hour, cool ambient light, soft shadows · high-key lighting, bright even white · low-key film noir lighting, hard shadows, single hard light · neon lighting, magenta and cyan, night, urban

golden hour portrait — Fig 6.7Light is a vocabulary. Each term sets direction, temperature and mood in one word. studio portrait of a woman, Rembrandt lighting, single soft key, dark background --seed 4 --ar 16:9 · golden hour backlight, warm rim light, lens flare · blue hour, cool ambient light, soft shadows · high-key lighting, bright even white · low-key film noir lighting, hard shadows, single hard light · neon lighting, magenta and cyan, night, urban

6.8Shot type & composition V8.1 · seed 31

One climber, six framings — from the determination in her eyes to a speck on a glacier. Framing words are directing words.

extreme close-up of climber's eyes — Fig 6.8The director's six. Same mountain story, six emotional registers — intimacy, process, solitude, scale, heroism, unease. extreme close-up of a mountain climber's determined eyes --seed 31 --ar 16:9 · medium shot, checking gear on a ridge · full-body wide shot on a vast snowy face · aerial bird's-eye view, a speck on a glacier · low-angle hero shot against the sky · Dutch-angle shot, tense, tilted horizon

medium shot climber checking gear — Fig 6.8The director's six. Same mountain story, six emotional registers — intimacy, process, solitude, scale, heroism, unease. extreme close-up of a mountain climber's determined eyes --seed 31 --ar 16:9 · medium shot, checking gear on a ridge · full-body wide shot on a vast snowy face · aerial bird's-eye view, a speck on a glacier · low-angle hero shot against the sky · Dutch-angle shot, tense, tilted horizon

6.9The fully specified photographic prompt V8.1

Everything in this part, assembled into one sentence: scene, camera, lens, aperture, film stock, light, composition.

Fig 6.9The graduation image. Subject and place · Hasselblad X1D · 85mm at f/1.9 · Kodak Portra 400 · soft directional light · rule of thirds — every clause earns a visible effect. cinematic film still of a woman in a red coat crossing a foggy cobblestone street at dawn, distant warm streetlights, shot on a Hasselblad X1D, 85mm at f/1.9, shallow depth of field, Kodak Portra 400, soft directional light, rule of thirds --ar 16:9 --style raw

↑ All chapters

7.1Vary Region — select and repaint V8.1 · Edit

Mask a region, re-prompt only that region. The rest of the image is never touched. Best on 20–50% of the frame.

woman in plain white t-shirt, base grid — Fig 7.1A wardrobe change, nothing else. Same woman, same wall, same light — only the masked region was re-imagined. a woman in a plain white t-shirt standing against a brick wall --ar 4:5 --v 8.1 → Edit · mask the shirt · region prompt: a denim jacket over a striped shirt

editor with the t-shirt masked — Fig 7.1A wardrobe change, nothing else. Same woman, same wall, same light — only the masked region was re-imagined. a woman in a plain white t-shirt standing against a brick wall --ar 4:5 --v 8.1 → Edit · mask the shirt · region prompt: a denim jacket over a striped shirt

7.2Retexture — same structure, new skin V8.1 · Edit

Retexture regenerates the whole image in a new style while keeping its structure — composition, geometry and pose survive; the surface changes.

bicycle photo in the editor — Fig 7.2One bicycle, three materials. The frame never moves; the lean against the wall never changes — only the world it's rendered in. Compare with `--sref` (Part 3), which restyles at *generation* time; Retexture restyles an image you already have. a simple still life of a bicycle leaning against a wall --ar 3:2 --v 8.1 → Edit · Retexture: watercolor painting · then: neon-lit 3D render at night

bicycle retextured as a watercolor painting — Fig 7.2One bicycle, three materials. The frame never moves; the lean against the wall never changes — only the world it's rendered in. Compare with `--sref` (Part 3), which restyles at *generation* time; Retexture restyles an image you already have. a simple still life of a bicycle leaning against a wall --ar 3:2 --v 8.1 → Edit · Retexture: watercolor painting · then: neon-lit 3D render at night

7.3Canvas-expansion as storytelling V8.1 · Edit

The same Edit workflow as the lighthouse in 1.3 — but used narratively. A tight, anonymous frame; then the canvas widens and the story arrives.

Fig 7.3The reveal. The candle never changes — the world grows around it. Keep the added-region prompt about the *new* space only. a single candle burning in darkness --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: the candle in a vast cathedral hall

the candle revealed in a vast cathedral — Fig 7.3The reveal. The candle never changes — the world grows around it. Keep the added-region prompt about the *new* space only. a single candle burning in darkness --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: the candle in a vast cathedral hall

editor with the candle and a widened empty canvas — Fig 7.3bThe mechanics. The widened frame waits empty around the original square before the cathedral is prompted into existence.

candle base grid — Fig 7.3bThe mechanics. The widened frame waits empty around the original square before the cathedral is prompted into existence.

7.4Tiling — seamless patterns --tile

The --tile flag makes every edge match its opposite edge — one generation becomes infinite wallpaper.

seamless botanical pattern tile — Fig 7.4One tile, infinite fabric. Look for the joins in the right-hand panel — there aren't any. Made for textiles, wrapping paper, game textures and website backgrounds. a seamless pattern of botanical leaves and berries, flat illustration --tile --ar 1:1

the same tile repeated 3 by 3 — Fig 7.4One tile, infinite fabric. Look for the joins in the right-hand panel — there aren't any. Made for textiles, wrapping paper, game textures and website backgrounds. a seamless pattern of botanical leaves and berries, flat illustration --tile --ar 1:1

7.5Image-to-video — pressing Animate V1 video

Any strong still can move. Generate, press Animate, describe the motion — Low motion for ambience, High when everything should travel. Image references aren't compatible with video.

Japanese garden still grid — Fig 7.5From frame to film. The chosen still goes in; a motion prompt — not an image prompt — tells it how to breathe. a lantern-lit Japanese garden at night, koi pond, falling maple leaves --ar 16:9 motion: leaves drift down, koi swim, lantern light flickers, gentle ripples on the water

the Animate interface with motion prompt — Fig 7.5From frame to film. The chosen still goes in; a motion prompt — not an image prompt — tells it how to breathe. a lantern-lit Japanese garden at night, koi pond, falling maple leaves --ar 16:9 motion: leaves drift down, koi swim, lantern light flickers, gentle ripples on the water

Fig 7.5bThe result, playing live. Five seconds of falling maple leaves, drifting koi and flickering lanterns — ≈8× the GPU cost of a still, and worth every minute. (Print edition: this becomes a 3-frame filmstrip.)

↑ All chapters

8.1Conversational vs Midjourney-style V8.1

Midjourney isn't a chatbot. Politeness, filler and full sentences become noise the model must interpret around.

cat from a conversational prompt — Fig 8.1Both work — one is steered. The conversational version lands somewhere pleasant; the descriptive version lands where you aimed: windowsill, warm afternoon, soft background, 4:5. Hello, could you please make me a really nice picture of a cat that is sitting by a window looking thoughtful in the afternoon? a thoughtful cat sitting on a windowsill, warm afternoon light, soft focus background --ar 4:5

cat from a clean descriptive prompt — Fig 8.1Both work — one is steered. The conversational version lands somewhere pleasant; the descriptive version lands where you aimed: windowsill, warm afternoon, soft background, 4:5. Hello, could you please make me a really nice picture of a cat that is sitting by a window looking thoughtful in the afternoon? a thoughtful cat sitting on a windowsill, warm afternoon light, soft focus background --ar 4:5

8.2Over-stuffed vs trimmed V8.1

Superlatives are not instructions. V7 and V8 reward restraint — four concrete nouns beat fourteen adjectives.

over-stuffed prompt result — Fig 8.2The kitchen-sink prompt vs the brief. Both are mountain lakes at sunset — but the trimmed one has a composition, because every word in it was a decision. a beautiful stunning gorgeous highly detailed ultra realistic amazing epic majestic breathtaking landscape with mountains and a lake and trees and a sunset and birds and clouds and reflections, 8k, masterpiece, award winning a mountain lake at sunset, mirror reflection, pine forest, dramatic clouds --ar 16:9

trimmed prompt result — Fig 8.2The kitchen-sink prompt vs the brief. Both are mountain lakes at sunset — but the trimmed one has a composition, because every word in it was a decision. a beautiful stunning gorgeous highly detailed ultra realistic amazing epic majestic breathtaking landscape with mountains and a lake and trees and a sunset and birds and clouds and reflections, 8k, masterpiece, award winning a mountain lake at sunset, mirror reflection, pine forest, dramatic clouds --ar 16:9

8.3Forgetting the aspect ratio V8.1

A panorama needs a panoramic canvas. Set --ar before you generate, not after you're disappointed.

desert highway in the default frame — Fig 8.3Give the road room. Without `--ar` the scene squeezes into whatever frame the model defaults to; at 21:9 the horizon finally behaves like a horizon. a panoramic desert highway disappearing into the horizon a panoramic desert highway disappearing into the horizon --ar 21:9

desert highway at 21:9 — Fig 8.3Give the road room. Without `--ar` the scene squeezes into whatever frame the model defaults to; at 21:9 the horizon finally behaves like a horizon. a panoramic desert highway disappearing into the horizon a panoramic desert highway disappearing into the horizon --ar 21:9

8.4Seed reproducibility V8.1 · “99% identical”

The same seeded prompt, run twice on different occasions. Seeds make experiments fair — and revisions possible.

paper boat, first run — Fig 8.4Two runs, one boat. Near-identical — note V8.1 promises “99% identical,” not pixel-exact. Lock a seed when you want to test one variable at a time; that discipline built every comparison in this book. a paper boat floating on a puddle, reflection of city lights --seed 4242 × 2

paper boat, second run — Fig 8.4Two runs, one boat. Near-identical — note V8.1 promises “99% identical,” not pixel-exact. Lock a seed when you want to test one variable at a time; that discipline built every comparison in this book. a paper boat floating on a puddle, reflection of city lights --seed 4242 × 2

↑ All chapters

AParameter quick reference

Parameter	Range / default	What it does
`--ar`	up to 14:1 · def 1:1	Aspect ratio — set it before you generate
`--s` stylize	0–1000 · def 100	How much Midjourney aesthetic is imposed
`--c` chaos	0–100 · def 0	Variety between the four grid images
`--w` weird	0–3000 · def 0	Unconventionality of each image
`--style raw`	flag	Literal mode — the realism foundation
`--no`	nouns	Negative prompt
`--seed`	0–4.29bn	Reproducibility (V8.1: “99% identical”)
`--tile`	flag	Seamless repeating patterns
`--sref / --sw`	sw 0–1000 · def 100	Style reference and its strength V7
`--oref / --ow`	ow 1–1000 · def 100	Omni Reference — characters & objects V7 only
`--iw`	up to 3	Image-prompt vs text influence V7
`--p`	moodboard ID	Personalization / moodboards (strength via --s)
`--v / --niji`	6–8.1 · niji 7	Model version — tag every figure

BReusable control subjects

Pick one per chapter and reuse it across every variable so readers see apples-to-apples: people — “a young woman laughing at a kitchen table” / “a weathered fisherman”; object — “a single ripe pear on a wooden table”; scene — “a lone lighthouse on a rocky cliff”; action — “a mountain climber on a ridge.”

CRights, rules & the law drafted · verify before print

Ownership: paid subscribers own what they create “to the fullest extent possible under applicable law,” with broad commercial rights that survive cancellation; companies grossing over $1M/year must be on Pro or Mega. Copyrightability: the US Copyright Office holds that purely AI-generated images are not copyrightable — only substantial human authorship is protectable, so refine outputs in an editor to strengthen claims. Moderation: roughly PG-13, enforced everywhere including private and stealth work. Litigation: Disney/Universal (June 2025) and Warner Bros. Discovery (Sept 2025) suits remain active through 2026 — this chapter must be re-verified at press time. Best practice taught throughout this book: lean on medium, technique and era language plus image-based --sref, not living artists' names.

↑ All chapters

Chapters

Fundamentals

Prompting Techniques

Style References & Omni

Raw Mode & Realism

Mediums & Artistic Styles

Camera & Lens Language

Advanced Workflows

Troubleshooting

Appendices

Front MatterHow to read this book

Current as of June 2026 — read this box first

You pay in GPU time, not in images

Fundamentals

1.1The very first prompt — anatomy of a prompt V8.1

1.2The grid and the V8.1 Creation Actions V8.1

1.3Expanding the canvas via Edit V8.1 · Edit

1.4Describe — reverse-prompting workflow

Prompting Techniques

2.1Word order & front-loading V8.1

2.2Specific vs vague descriptors V8.1

2.3The stylize dial — --s V8.1 · seed 2200

2.4The chaos dial — --c V8.1

2.5The weird dial — --w V8.1

2.6Multi-prompts & weights — :: V6.1 · version-dependent

2.7Negative prompting — --no V8.1

2.8Permutation prompts — { } V8.1 · Fast/Turbo only

2.9Image weight — --iw V7

Style References, Moodboards & Omni Reference

3.1Style reference basics — --sref V7

3.2Style weight — --sw V7 · sref 1234567890

3.3Blending multiple style references V7

3.4Omni Reference — the consistency centerpiece V7 only

3.5Omni weight — --ow V7

3.6Moodboards & personalization — --p V7

Raw Mode & Realism Controls

4.1Default vs Raw — the single most useful realism comparison V8.1 · seed 88

4.2The realism recipe — Raw + low stylize V8.1

Mediums & Artistic Styles

5.1The medium gallery — one pear, twelve hands V8.1

5.2Watercolour — a deeper dive V8.1

5.3Oil painting — a deeper dive V8.1

5.4Anime & Niji Niji 7

5.5Photorealism & hyperrealism V8.1 · raw

5.6Concept art, matte painting & illustration V8.1

5.7Graphic, vector & flat styles V8.1

Camera, Lens & Photography Language

6.1The flagship comparison — Hasselblad vs iPhone vs disposable V8.1

6.2Camera body associations V8.1 · seed 12

6.3Focal length — the walk backwards V8.1 · seed 77

6.4Aperture — f/1.2 vs f/8 V8.1

6.5Specialty lenses V8.1

6.6Film stocks V8.1

6.7The lighting cheat-sheet V8.1 · seed 4

6.8Shot type & composition V8.1 · seed 31

6.9The fully specified photographic prompt V8.1

Advanced Workflows

7.1Vary Region — select and repaint V8.1 · Edit

7.2Retexture — same structure, new skin V8.1 · Edit

7.3Canvas-expansion as storytelling V8.1 · Edit

7.4Tiling — seamless patterns --tile

7.5Image-to-video — pressing Animate V1 video

Troubleshooting — “Why Did It Do That?”

8.1Conversational vs Midjourney-style V8.1

8.2Over-stuffed vs trimmed V8.1

8.3Forgetting the aspect ratio V8.1

8.4Seed reproducibility V8.1 · “99% identical”

Parameters, Control Subjects, Rights & Law

AParameter quick reference

BReusable control subjects

CRights, rules & the law drafted · verify before print

2.3The stylize dial — `--s` V8.1 · seed 2200

2.4The chaos dial — `--c` V8.1

2.5The weird dial — `--w` V8.1

2.6Multi-prompts & weights — `::` V6.1 · version-dependent

2.7Negative prompting — `--no` V8.1

2.8Permutation prompts — `{ }` V8.1 · Fast/Turbo only

2.9Image weight — `--iw` V7

3.1Style reference basics — `--sref` V7

3.2Style weight — `--sw` V7 · sref 1234567890

3.5Omni weight — `--ow` V7

3.6Moodboards & personalization — `--p` V7