01 Fundamentals02 Prompting Techniques03 Style References04 Raw Mode05 Mediums06 Camera07 Advanced Workflows08 Troubleshooting09 Appendices
A painter's brushstrokes turning into a real landscape

A Visual Field Guide

The Midjourney

Book

Scroll to open ↓

From your very first prompt to studio-grade control — how to think, see, and create with the world's most opinionated image model. Ten chapters, every figure a controlled experiment, every prompt copy-pasteable.

Chapters

Pick a door. Each chapter opens with its hero and reads top to bottom.

Front MatterHow to read this book

Midjourney rewards people who learn to describe what they see. This book teaches that skill the way a photography book would — with pictures first, side-by-side comparisons, and the smallest possible change between any two images.

Every figure follows one discipline: change only the variable being taught. Same subject, same seed where it matters, one swapped phrase or parameter. The captions tell you what moved; your eyes do the rest. Every prompt printed beside a figure is copy-pasteable.

Current as of June 2026 — read this box first

Midjourney moves fast. As this book went to press: V7 is the documented platform default (released April 2025); V8.1 is the newest model (April 30, 2026) — 4–5× faster, native 2K “HD” output — but the whole V8 line is officially alpha and changing, sometimes without notice. Several mature features (Omni Reference, multi-prompting, Niji, the Quality parameter, Draft Mode, Turbo) live in V7 and are not yet in V8.1. Every figure in this book is tagged with the model that produced it. Verify anything version-specific against docs.midjourney.com before you rely on it.

You pay in GPU time, not in images

The single most misunderstood thing for beginners. Plans ($10–$120/month) buy Fast GPU hours — a standard image costs roughly one GPU-minute; V8.1 HD ≈ 1.33 minutes; video ≈ 8× an image. Standard plans and above add unlimited Relax mode (queued, slower, free). Explore in Relax, spend Fast hours on finals. Unused Fast hours don't roll over.

01

Part One

Fundamentals

A prompt is not a wish — it's a description. In four images, watch a single word grow into a photograph, then meet the buttons that turn one result into many.

1.1The very first prompt — anatomy of a prompt V8.1

Midjourney's prompt grammar is simple: subject → context → lighting → style/camera → parameters. Each layer you add removes a decision from the model and hands it to you.

Start with two words and Midjourney fills every gap with its own taste. Add a setting and it stops guessing where you are. Add light and it stops guessing when. Add a lens and it stops guessing how the scene was seen. The fox below never changes species — only how much of the picture is yours.

1 · subjecta fox
2 · + contexta red fox sitting in a forest
3 · + lightingmisty pine forest at dawn
4 · + camera & params85mm, style raw
Fig 1.1Same animal, escalating control. Subject → context → lighting → style/camera + parameters. a fox a red fox sitting in a forest a red fox sitting in a misty pine forest at dawn, soft golden light a red fox sitting in a misty pine forest at dawn, soft golden backlight, shallow depth of field, photorealistic, shot on 85mm lens --ar 3:2 --style raw
grid · a foxfox grid in the web app
grid · forestred fox forest grid
grid · dawnmisty dawn grid
grid · full promptfull prompt grid
Fig 1.1bWhat you actually see. Every job returns four candidates in the web app's Create feed. Notice how the variety within each grid shrinks as the prompt gets more specific — fewer gaps for the model to fill.

1.2The grid and the V8.1 Creation Actions V8.1

One prompt, four candidates — then a small row of verbs: Vary (Subtle/Strong), Rerun, Edit, Use (Image/Style/Prompt), Animate.

V8.1's action row is leaner than the classic Discord-era buttons. There is no standalone Upscale — images render at native HD (2K) — and no one-click Zoom Out or Pan; canvas work lives inside Edit. If you've seen U1–U4 / V1–V4 buttons in older tutorials, that's the Discord / V6-era interface.

the gridballoon grid
hover · creation actionsVary Subtle / Vary Strong / Animate
Fig 1.2One prompt, four candidates — and the verbs that act on them. Hovering a tile reveals Vary Subtle, Vary Strong and Animate; the full action set sits in the panel to the right. a vintage hot air balloon drifting over a patchwork of green farmland, morning haze, gentle clouds --ar 3:2 --v 8.1
grid pickselected balloon
rerun · new jobrerun balloon
Fig 1.2bRerun is V8.1's reroll. The same prompt run twice: same brief, different dice. Curate across jobs, not just within one grid.
Vary SubtleVary Subtle result grid
Vary StrongVary Strong result grid
Fig 1.2cSubtle keeps the painting; Strong repaints it. The same chosen balloon, varied two ways: Subtle nudges clouds and light while the composition holds; Strong renegotiates framing, sky and landscape. Use Subtle to polish a keeper, Strong to escape a near-miss.

1.3Expanding the canvas via Edit V8.1 · Edit

On V8.1, the canvas-expansion tools aren't buttons on the result — you open Edit, widen the frame, and generate into the empty space. The original pixels stay put.

Keep the added-region prompt short and literal — you're describing only what the new space should contain, not re-describing the whole picture.

base · 1:1lighthouse base
01 · open in Editeditor open
02 · widen canvascanvas expanded
03 · extend againcanvas expanded further
Fig 1.3The Edit workflow, step by step. A square lighthouse becomes a panorama: open Edit, drag the frame wider, prompt only the new region. a lone lighthouse on a rocky cliff, dramatic stormy sky, crashing waves --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: rocky coastline and ocean
extended lighthouse panorama
Fig 1.3bThe finished extension (29:18). The original square survives untouched in the centre-right; everything else was imagined to order. On Discord / V6.1 this is the one-click Zoom Out / Pan — label your figures by interface.

1.4Describe — reverse-prompting workflow

Midjourney can run backwards: feed it an image, and Describe returns four prompts that would plausibly produce it. It is the fastest way to learn how the model "hears" pictures.

Describe · four prompts backDescribe panel showing the uploaded photo and four suggested prompts
Fig 1.4Midjourney running backwards. One uploaded photo (right) comes back as four candidate prompts — each a different reading of the same image: the scene, the materials, the lighting, the mood. Reading them teaches you how the model "hears" pictures.
generated result: a young fashion designer sketching in a Parisian studio
Fig 1.4bFrom description back to image. One of the four prompts, lightly adapted and re-run — the loop closed. Adapt, don't just run: the returned prompts are raw material, not finished briefs. a young male fashion designer in a chic Parisian studio, sketching at a worktable, drapes of white fabric, soft window light
↑ All chapters
02

Part Two

Prompting Techniques

Word order, specificity, and four dials — stylize, chaos, weird, and no — that decide how much of the picture is yours and how much is Midjourney's.

2.1Word order & front-loading V8.1

Midjourney weights earlier words more heavily. What you put first dominates the composition.

knight firstknight-first grid
castle firstcastle-first grid
Fig 2.1Identical words, reordered. Lead with the knight and you get portraits of a knight; lead with the castle and the architecture takes the frame. a knight in golden armor standing before a small grey castle a small grey castle behind a knight in golden armor
knight firstknight first pick
castle firstcastle first pick
Fig 2.1bFull resolution, side by side. Knight-first fills the frame with armour; castle-first pushes the knight into the midground and lets the architecture breathe.

2.2Specific vs vague descriptors V8.1

"Nice" and "good" are decisions you're asking the model to make. Concrete nouns — heather, storm clouds, a distant loch — are decisions you've already made.

vaguea nice landscape with good lighting
specificwindswept Scottish highland glen
Fig 2.2Vague → pleasant but generic. Specific → a place. a nice landscape with good lighting a windswept Scottish highland glen, heather in bloom, low storm clouds breaking into golden shafts of light, distant loch --ar 16:9

2.3The stylize dial — --s V8.1 · seed 2200

Stylize controls how much of Midjourney's own lush aesthetic gets imposed on your prompt. 0 = literal and plain; 750 = the house style takes over. Default is 100.

This is the most important dial in the book. Identical prompt, identical seed — only --s changes. Watch the humble jug become an ornament.

--s 0stylize 0
--s 100 (default)stylize 100
--s 400stylize 400
--s 750stylize 750
Fig 2.3The stylize sweep. At 0 the jug is earthenware and the light is honest. By 750 every surface is hand-painted majolica and the lemons glow. Neither is "better" — they are different amounts of Midjourney. a still life of lemons and a ceramic jug on a wooden table, natural window light --s 0 --seed 2200 · then --s 100 · 400 · 750

2.4The chaos dial — --c V8.1

Chaos doesn't change one image — it changes how different the four grid candidates are from each other. Show the whole grid, every time.

--c 0chaos 0 grid
--c 25chaos 25 grid
--c 60chaos 60 grid
Fig 2.4Four siblings → four strangers. At 0 the grid agrees on one idea. At 60 you get upside-down islands, dead trees and sunsets you never asked for — great for discovery, risky for briefs. a surreal floating island with a waterfall pouring into clouds --c 0 · then --c 25 · --c 60
c 0 pickchaos 0 pick
c 25 pickchaos 25 pick
c 60 pickchaos 60 pick
Fig 2.4bOne pick from each grid, full resolution. The chaos-60 candidate barely remembers the brief — and that's the point.

2.5The weird dial — --w V8.1

Where chaos varies the grid, weird bends each image itself away from convention. A teapot "designed by nature" is the perfect stress test.

--w 0weird 0 grid
--w 250weird 250 grid
--w 1000weird 1000 grid
Fig 2.5From teapot to driftwood sculpture. At 0 every candidate still pours tea. At 1000 the model keeps the soul of the prompt and abandons the object. a teapot designed by nature, organic forms --w 0 · then --w 250 · --w 1000
w 0weird 0 pick
w 250weird 250 pick
w 1000weird 1000 pick
Fig 2.5bThe full sweep at full resolution. At 0 it pours tea; at 250 it's driftwood that remembers being a teapot; at 1000 it's a striped orb in a wire cage that has clearly never met tea.

2.6Multi-prompts & weights — :: V6.1 · version-dependent

The double colon splits a prompt into separate concepts; numbers set their relative emphasis. space ship is one idea. space:: ship is two.

Officially supported in V6/V6.1 and version-dependent in V7/V8.1 — flag every multi-prompt figure with the model used.

space ship as one concept
Fig 2.6“space ship” — read as one concept. A spaceship, as expected. space ship
space:: shipspace and ship as two concepts
balloon::2 city::1weighted balloon over city
Fig 2.6bThe split and the weights. V6.1 With the double colon, “space” and “ship” become separate ingredients — a vessel sailing the cosmos rather than a spacecraft. Weights then set the mix: balloon::2 city::1 keeps the balloon the star and the city the supporting act. space:: ship --ar 16:9 --v 6.1 hot air balloon::2 city below::1 --ar 3:2 --v 6.1

2.7Negative prompting — --no V8.1

Telling Midjourney what to leave out. Imperfect, occasionally stubborn, but the simplest subtraction tool you have.

defaultfruit bowl with bananas
--no bananasfruit bowl no bananas
Fig 2.7Subtraction by flag. A multi-word --no phrase parses poorly; keep it to simple nouns. a bowl of mixed fruit on a table a bowl of mixed fruit on a table --no bananas

2.8Permutation prompts — { } V8.1 · Fast/Turbo only

Curly braces turn one line into a batch: one prompt, three jobs, each consuming GPU time. The cleanest way to run a controlled material study.

ceramicceramic vase
glassglass vase
coppercopper vase
Fig 2.8One line → three jobs. Everything held constant except the material inside the braces. a {ceramic, glass, copper} vase holding white peonies, soft studio light --ar 4:5

2.9Image weight — --iw V7

When a prompt contains a reference image, --iw decides who wins: low = the text; high = the picture.

The reference here is this book's own lighthouse (Fig 1.3). The text asks for a castle on a hill — watch the lighthouse's storm, cliff and mood take over as --iw rises.

--iw 0.5image weight 0.5 grid
--iw 1.5image weight 1.5 grid
--iw 3image weight 3 grid
Fig 2.9Who wins — the words or the picture? At 0.5 the text wins: fairy-tale castles, sunset skies. At 3 the reference wins: dark storms, rocky coast, crashing waves — castles that are lighthouses in everything but name. [lighthouse image] a castle on a hill --iw 0.5 --ar 16:9 --v 7 · then --iw 1.5 · --iw 3
iw 0.5 · text winsimage weight 0.5 pick
iw 1.5 · negotiationimage weight 1.5 pick
iw 3 · image winsimage weight 3 pick
Fig 2.9bOne pick from each weight, full resolution.
↑ All chapters
03

Part Three

Style References, Moodboards & Omni Reference

Midjourney's real differentiation: borrow a look with --sref, average a taste with moodboards, and put the same character in any scene with Omni Reference. The spine of the advanced half of this book.

3.1Style reference basics — --sref V7

The subject stays constant; the look is borrowed from a numeric style code. Use --sref random to discover codes, then lock the keepers.

A style code is a coordinate in Midjourney's space of aesthetics — pure look, no content. The same rainy café below is rendered three times; only the code changes, and with it the entire visual language: warm storybook paint, lantern-lit old-world atmosphere, hard-edged flat colour.

--sref 466919608café grid, sref 466919608
--sref 1234567890café grid, sref 1234567890
--sref 987654321café grid, sref 987654321
Fig 3.1Same café, three borrowed looks. Each grid keeps its own internal consistency — that's what makes codes reusable across a whole project. (Codes are arbitrary numbers, not rankings; these were locked after exploring with --sref random.) a quiet city street café in the rain --sref 466919608 · then --sref 1234567890 · --sref 987654321
466919608café pick, code 466919608
1234567890café pick, code 1234567890
987654321café pick, code 987654321
Fig 3.1bOne pick from each code, full resolution. Subject, weather and time of day survive every restyle — only the aesthetic moves.

3.2Style weight — --sw V7 · sref 1234567890

How hard the borrowed style is pushed. 0–1000, default 100 — at 50 the code whispers; at 400 it shouts.

--sw 50style weight 50 grid
--sw 100style weight 100 grid
--sw 400style weight 400 grid
Fig 3.2The same code at three volumes. One mountain landscape, one style code, three weights — the painterly hand grows steadier and more insistent as --sw climbs. a mountain landscape --sref 1234567890 --sw 50 · then --sw 100 · --sw 400
sw 50style weight 50 pick
sw 100style weight 100 pick
sw 400style weight 400 pick
Fig 3.2bFull-resolution picks. The sw-100 frame is the regenerated --no animals version — the original run smuggled a mountain lion onto its rock (kept in Pics/ as a souvenir of Midjourney filling gaps the prompt left open).

3.3Blending multiple style references V7

Stack space-separated codes to fuse aesthetics. One code is a borrowed look; two codes are a negotiation.

one code · --sref 466502964astronaut grid, single style code
two codes · 1234567890 + 987654321astronaut grid, blended style codes
Fig 3.3Solo vs duet. A single code commits hard to one aesthetic (here: neon geometry). Stacking the book's two familiar codes from Fig 3.1 produces something neither would do alone — warm painterly texture with graphic colour. Start with one before stacking; many srefs at high weight create a muddled tug-of-war. a portrait of an astronaut --sref 466502964 --sw 150 --ar 16:9 --v 7 a portrait of an astronaut --sref 1234567890 987654321 --sw 150 --ar 16:9 --v 7
blended-style astronaut, full resolution
Fig 3.3bThe blend at full resolution. Codes are space-separated, not comma-separated — and the syntax matters: bracketed or doubled --sref flags parse differently (we have the failed attempts to prove it).

3.4Omni Reference — the consistency centerpiece V7 only

One reference image, four scenes, one recognisable explorer. This is the technique the identity-drift warnings in Parts 2 and 6 have been pointing at.

Market, ridge, tent, campfire — different light, different weather, different framing, same red scarf and same face. Keep --ow ≤ 400 so the reference doesn't override the scene, and budget 2× GPU per job. Not yet in V8.1.

bustling marketexplorer in a market
snowy ridgeexplorer climbing a snowy ridge
lantern-lit tentexplorer reading a map in a tent
campfire, nightexplorer laughing at a campfire
Fig 3.4Four postcards from one person. The same reference image used for the 3.5 weight sweep, held at --ow 200 throughout. a young explorer with a red scarf standing in a bustling market --oref [reference] --ow 200 --ar 3:2 --v 7 · climbing a snowy ridge · reading a map by lantern light in a tent · laughing at a campfire, night
attaching the referenceprompt bar with Omni Reference attachment menu
Fig 3.4bWhere the magic attaches. The paperclip menu offers three kinds of reference — Image Prompts (use the elements), Style References (use the look), and Omni Reference (use a person's likeness or an object's form). Same image, three very different jobs.

3.5Omni weight — --ow V7

With an Omni Reference attached, --ow decides how much the reference image rules. At 50 the character dissolves into the prompt; at 400 he's unmistakably the same man in every frame.

This is the dial behind consistent characters. Low weight treats the reference as a loose suggestion — the grids invent new faces, wardrobes, even whole genres. By 200 the face locks; by 400 the wardrobe and kit lock too. Keep --ow ≤ 400 so the reference doesn't fight the scene, and budget for it: Omni Reference costs 2× GPU.

--ow 50omni weight 50 grid
--ow 200omni weight 200 grid
--ow 400omni weight 400 grid
Fig 3.5One reference, three loyalties. At 50, four strangers in four art styles. At 200, the reference face appears in every tile. At 400, face, scarf and gear all hold. the explorer in a desert --oref [reference] --ow 50 --ar 16:9 · then --ow 200 · --ow 400
ow 50 · text winsomni weight 50 pick
ow 50 · second jobomni weight 50 alt pick
ow 200 · face locksomni weight 200 pick
ow 400 · everything locksomni weight 400 pick
Fig 3.5bFull-resolution picks across the sweep. The two ow-50 frames (from two separate jobs) don't even agree with each other; the ow-200 and ow-400 frames are recognisably one person.

3.6Moodboards & personalization — --p V7

A curated 5–10 image moodboard averaged into a reusable house style. Strength is driven by --s, not --sw.

Where a style code borrows one look, a moodboard distils your taste — feed it a handful of images you love and reference it by ID. The cups below carry the board's palette and mood without copying any single source image.

moodboard · “My First Moodboard” · --s 300moodboard coffee cup grid
Fig 3.6The board speaks through the grid. All four candidates share a palette and temperament no plain prompt would produce. a brand lifestyle photo of a coffee cup on a linen tablecloth --p [My First Moodboard] --s 300
moodboard coffee cup pick 1
moodboard coffee cup pick 2
Fig 3.6bTwo picks, one house style. Different compositions, same brand world — exactly what a moodboard is for.
↑ All chapters
04

Part Four

Raw Mode & Realism Controls

Raw strips Midjourney's beautifying hand for a more literal, documentary result — the foundation of every realism recipe in this book.

4.1Default vs Raw — the single most useful realism comparison V8.1 · seed 88

Same prompt, same seed; only --style raw added. Default applies Midjourney's signature beautifying hand — Raw strips it for a more literal, documentary result.

Look at what changes and what doesn't. The bakery, the flour, the morning light all survive. What Raw removes is the cinema: the default grid leans into moody shadows and hero lighting, while Raw settles for a man at work who happens to be photographed well. For realism recipes, Raw is the foundation everything else builds on.

defaultbaker grid, default style
--style rawbaker grid, raw style
Fig 4.1Two grids, one seed. The default grid is graded like a film still; the Raw grid is lit like a documentary. a candid portrait of an elderly baker dusted with flour, morning light in a bakery --seed 88 a candid portrait of an elderly baker dusted with flour, morning light in a bakery --style raw --seed 88
defaultbaker default pick
--style rawbaker raw pick
Fig 4.1bFull-resolution picks. Default poses him; Raw catches him.

4.2The realism recipe — Raw + low stylize V8.1

Pair --style raw with a modest stylize value (100–250) and documentary language, and Midjourney stops painting and starts reporting.

fisherman mending nets, raw s150
fisherman mending nets, raw s150, second pick
Fig 4.2The working formula. Overcast light, the word “documentary,” Raw mode, stylize 150 — nothing glamorous, everything believable. a weathered fisherman mending nets on a harbor wall, overcast light, documentary --style raw --s 150 --ar 4:5
↑ All chapters
05

Part Five

Mediums & Artistic Styles

Name the medium, the technique, and the era — specificity beats adjectives. One recurring control subject (“a single ripe pear on a wooden table”) rendered across a dozen media makes the gallery-wall spread of the book.

5.1The medium gallery — one pear, twelve hands V8.1

One subject, one sentence, twelve mediums. Naming the medium and its technique — “wet-on-wet washes,” “heavy impasto,” “lead lines, glowing backlight” — is the most powerful style lever in plain language.

This is the gallery-wall spread of the book. Read it like a museum room: the fruit never changes, only the hand that made it. Notice how the technique words do the real work — “cross-hatching” builds the graphite, “subsurface scattering” lights the 3D render from inside.

watercolorpear, watercolor
oil · impastopear, oil impasto
graphitepear, graphite sketch
charcoalpear, charcoal
gouachepear, gouache
pen & inkpear, pen and ink
3D render · Octanepear, 3D render
low-poly 3Dpear, low-poly
stained glasspear, stained glass
pixel artpear, pixel art
woodcutpear, woodcut print
photoreal · 100mm macropear, photorealistic macro
Fig 5.1Twelve mediums, one pear. Every prompt begins “a single ripe pear on a wooden table,” then names the medium and two or three technique words. a single ripe pear on a wooden table, watercolor painting, soft wet-on-wet washes, textured paper · oil painting, heavy impasto, palette knife · graphite pencil sketch, cross-hatching · charcoal, smudged shading · gouache, flat matte color · pen and ink line art · 3D render, Octane, subsurface scattering · low-poly 3D · stained glass, lead lines, glowing backlight · pixel art, 32-bit · woodcut print, two-color · photorealistic, 100mm macro --style raw

5.2Watercolour — a deeper dive V8.1

One medium, three temperaments: loose and bleeding, light and architectural, precise and scientific. The technique words steer it.

loose · wet-on-wetloose watercolor poppies
delicate · airy washeswatercolor Venice canals
botanical · precisebotanical fern study
Fig 5.2Three watercolours that share almost no DNA. “Wet-on-wet bleed” loosens, “minimal linework, light and airy” lightens, “scientific illustration style” tightens. loose watercolor painting of bright orange California poppies in a meadow, wet-on-wet bleed, white paper showing through --ar 3:2 delicate watercolor cityscape of Venice canals, soft washes, minimal linework, light and airy --ar 16:9 botanical watercolor study of fern fronds, precise detail, scientific illustration style --ar 4:5

5.3Oil painting — a deeper dive V8.1

Impasto, Old Master, Impressionist — three centuries of oil in three prompts. Era language is style language.

impasto · palette knifeexpressive oil portrait
baroque · Old Masterclassical oil storm at sea
impressionist · broken colorimpressionist sunlit garden
Fig 5.3Same pigment, different centuries. expressive oil portrait of a young woman, heavy impasto, visible palette-knife strokes, chiaroscuro --ar 4:5 --s 600 classical oil painting of a storm at sea, dramatic baroque lighting, Old Master style --ar 3:2 impressionist oil painting of a sunlit garden, dappled light, broken color, loose brushwork --ar 3:2

5.4Anime & Niji Niji 7

Niji 7 uses “precision prompting” — it renders what you specify and invents little, the opposite of V7's gap-filling. Describe the type of character; never name a copyrighted one.

dynamic actionanime action scene, cyberpunk swordswoman
slice of lifecozy anime ramen stall
whimsical · hand-drawnchild and forest spirit on a hill
90s retro cel90s anime mecha pilot
Fig 5.4Niji's range. Neon action, lantern-lit comfort, storybook whimsy, CRT-glow nostalgia — all from one model, all from explicit description. dynamic anime action scene, a young swordswoman deflecting glowing energy, neon cyberpunk city at night --ar 16:9 --niji 7 · cozy slice-of-life ramen stall · whimsical child and forest spirit · 90s retro cel mecha pilot, CRT glow, film grain --ar 4:3

5.5Photorealism & hyperrealism V8.1 · raw

Material realism: skin pores, refracted dew, condensation. Texture language plus --style raw — camera grammar gets all of Part 6.

skinextreme close-up weathered fisherman
water · refractiondewdrop on spider web macro
condensationcondensation on iced coffee glass
Fig 5.5Realism is a materials problem. “Visible skin pores,” “refracted light,” “water droplets” — name the physics and the model renders it. extreme close-up portrait of a weathered fisherman, visible skin pores, catchlight in the eyes --style raw --s 150 --ar 4:5 · hyperrealistic macro of a dewdrop on a spider's web at dawn --ar 1:1 · photorealistic still life of condensation on a cold glass of iced coffee --ar 4:5

5.6Concept art, matte painting & illustration V8.1

Three professional registers: cinematic environment, industrial design sheet, picture-book warmth.

matte paintingruined floating temple concept art
concept sheetsci-fi rover concept sheet
children's bookfriendly whale carrying a lighthouse
Fig 5.6Job-specific vocabularies. “Orthographic views, annotations” turns a picture into a design document; “soft textured shapes, warm palette” turns it into a bedtime story. epic fantasy environment concept art, a ruined floating temple above misty peaks, matte painting --ar 16:9 --s 600 · sci-fi vehicle concept sheet, orthographic views, annotations · children's book illustration, a friendly whale carrying a lighthouse --ar 3:2

5.7Graphic, vector & flat styles V8.1

Flat colour, overprint grain, isometric miniatures — graphic-design looks respond to print-trade vocabulary.

flat vectorflat vector city skyline at sunset
risographrisograph print poster
isometric 3Disometric tiny coffee shop
Fig 5.7Speak the printer's language. “Two-color overprint, grainy texture” is what makes a riso a riso. minimalist flat vector illustration of a city skyline at sunset, limited palette --ar 16:9 · bold risograph print poster, two-color overprint, grainy texture, retro --ar 4:5 · isometric 3D illustration of a tiny cozy coffee shop --ar 1:1
↑ All chapters
06

Part Six

Camera, Lens & Photography Language

Midjourney was trained on millions of captioned photographs — gear names carry looks. The richest set of comparison figures in the book lives here.

6.1The flagship comparison — Hasselblad vs iPhone vs disposable V8.1

Midjourney was trained on millions of captioned photographs, so a camera's name carries its whole visual culture. Same woman, same kitchen, same morning — three cameras.

“Hasselblad X2D, medium format” summons editorial polish: controlled light, perfect falloff. “Shot on an iPhone” loosens everything — wider, brighter, casual. “Disposable film camera, direct flash” drags the scene twenty years backwards into snapshot nostalgia. None of these cameras exist in the image; only their reputations do.

Hasselblad X2Dkitchen portrait, Hasselblad look
iPhonekitchen portrait, iPhone look
disposable + flashkitchen portrait, disposable camera look
Fig 6.1Three cameras that were never there. a young woman laughing at a kitchen table, morning light, shot on a Hasselblad X2D, 80mm, medium format --seed 500 --style raw · shot on an iPhone · shot on a disposable film camera, direct flash

6.2Camera body associations V8.1 · seed 12

One stylish older man, six cameras. Each body name nudges colour, contrast, and grain toward the photographers who famously used it.

Leica M6 · 35mmstreet portrait, Leica look
Canon EOS R5 · 50mmstreet portrait, Canon look
Sony A7R V · 85mmstreet portrait, Sony look
Nikon Z9street portrait, Nikon look
RED · anamorphicstreet portrait, RED cinema look
4×5 large formatstreet portrait, large format look
Fig 6.2Six reputations, one face (or so the seed intends). Documentary classicism from the Leica, clinical sharpness from the mirrorless flagships, cinematic grading from the RED, slow-stare formality from large format. Note the model drifts on the man's identity between jobs — camera language steers look, not character consistency; that's Omni Reference's job (Part 3). a street portrait of a stylish older man, overcast daylight, shot on a Leica M6, 35mm film --seed 12 --style raw · Canon EOS R5, 50mm · Sony A7R V, 85mm · Nikon Z9 · a RED cinema camera, anamorphic · a 4×5 large format film camera

6.3Focal length — the walk backwards V8.1 · seed 77

From 24mm to 200mm, the camera “walks away” while the lens pulls back in. Watch the background compress from sprawling plaza to wall of bokeh.

24mm wide24mm plaza shot
35mm35mm plaza shot
50mm50mm plaza shot
85mm portrait85mm plaza portrait
200mm telephoto200mm compressed shot
Fig 6.3The full sweep, 24mm to 200mm. At 24mm the plaza is the subject; by 50mm the figure owns the frame; at 85mm it's a portrait; at 200mm the city is wallpaper. a person standing in a city plaza, 24mm wide angle --seed 77 --ar 16:9 · 35mm · 50mm · 85mm portrait · 200mm telephoto, heavy background compression
85mm portrait grid85mm grid
Fig 6.3bThe 85mm stop, grid view. The lens steps fully into portrait territory — chest-up, melting background.

6.4Aperture — f/1.2 vs f/8 V8.1

One number decides whether the world behind your subject exists.

50mm at f/1.2f/1.2 grid
50mm at f/8f/8 grid
Fig 6.4Creamy bokeh vs everything-in-focus. a coffee cup on a busy café table, 50mm at f/1.2, creamy bokeh, background melts away --ar 16:9 a coffee cup on a busy café table, 50mm at f/8, everything in sharp focus --ar 16:9
f/1.2coffee cup at f/1.2
f/8coffee cup at f/8
Fig 6.4bFull resolution, side by side. At f/1.2 the café dissolves into discs of light; at f/8 every chair, cup and window holds focus.

6.5Specialty lenses V8.1

Tilt-shift miniaturises a city; fisheye bends a trick into a bubble; a macro lens turns a wildflower field into a single bee's world.

tilt-shifttilt-shift city grid
fisheyefisheye skateboarder grid
macromacro bee strip
Fig 6.5Three lenses that are really three ideas. a city intersection from above, tilt-shift lens, miniature toy-town effect · a skateboarder mid-trick, fisheye lens, extreme distortion · a field of wildflowers, macro lens, single bee in sharp focus --ar 4:5
tilt-shifttilt-shift city, full resolution
fisheyefisheye skateboarder, full resolution
macromacro bee, full resolution
Fig 6.5bAll three lenses, full resolution. A toy-town intersection, a trick bent into a bubble, and one bee's universe.

6.6Film stocks V8.1

Four emulsions, one couple, one sunset. Film-stock names are colour-grading presets hiding in plain language.

Kodak Portra 400beach couple, Portra
Fujifilm Velvia 50beach couple, Velvia
CineStill 800Tbeach couple, CineStill
Ilford HP5beach couple, Ilford black and white
Fig 6.6Same beach, four chemistries. Portra keeps skin honest and warm; Velvia super-saturates the sky; CineStill cools into tungsten halation; Ilford strips it to silver. a couple walking on a beach at sunset, shot on Kodak Portra 400, warm natural skin tones, film grain · Fujifilm Velvia 50, ultra-saturated · CineStill 800T, tungsten, halation · Ilford HP5, black and white, classic grain

6.7The lighting cheat-sheet V8.1 · seed 4

Six lighting names, one studio portrait. This spread is the page readers will dog-ear.

RembrandtRembrandt lighting portrait
golden hour backlightgolden hour portrait
blue hourblue hour portrait
high-keyhigh-key portrait
low-key · film noirfilm noir portrait
neon · magenta & cyanneon lighting portrait
Fig 6.7Light is a vocabulary. Each term sets direction, temperature and mood in one word. studio portrait of a woman, Rembrandt lighting, single soft key, dark background --seed 4 --ar 16:9 · golden hour backlight, warm rim light, lens flare · blue hour, cool ambient light, soft shadows · high-key lighting, bright even white · low-key film noir lighting, hard shadows, single hard light · neon lighting, magenta and cyan, night, urban

6.8Shot type & composition V8.1 · seed 31

One climber, six framings — from the determination in her eyes to a speck on a glacier. Framing words are directing words.

extreme close-upextreme close-up of climber's eyes
medium shotmedium shot climber checking gear
full-body widewide shot of lone climber
aerial bird's-eyeaerial view of climber on glacier
low-angle herolow-angle hero shot of climber
Dutch angleDutch angle climber shot
Fig 6.8The director's six. Same mountain story, six emotional registers — intimacy, process, solitude, scale, heroism, unease. extreme close-up of a mountain climber's determined eyes --seed 31 --ar 16:9 · medium shot, checking gear on a ridge · full-body wide shot on a vast snowy face · aerial bird's-eye view, a speck on a glacier · low-angle hero shot against the sky · Dutch-angle shot, tense, tilted horizon

6.9The fully specified photographic prompt V8.1

Everything in this part, assembled into one sentence: scene, camera, lens, aperture, film stock, light, composition.

woman in a red coat crossing a foggy cobblestone street at dawn
Fig 6.9The graduation image. Subject and place · Hasselblad X1D · 85mm at f/1.9 · Kodak Portra 400 · soft directional light · rule of thirds — every clause earns a visible effect. cinematic film still of a woman in a red coat crossing a foggy cobblestone street at dawn, distant warm streetlights, shot on a Hasselblad X1D, 85mm at f/1.9, shallow depth of field, Kodak Portra 400, soft directional light, rule of thirds --ar 16:9 --style raw
↑ All chapters
07

Part Seven · to generate

Advanced Workflows

Repainting regions, retexturing whole images, growing stories at the edge of the frame, seamless patterns — and pressing Animate.

7.1Vary Region — select and repaint V8.1 · Edit

Mask a region, re-prompt only that region. The rest of the image is never touched. Best on 20–50% of the frame.

the base gridwoman in plain white t-shirt, base grid
mask the shirteditor with the t-shirt masked
re-prompt the regionresult: denim jacket over a striped shirt
Fig 7.1A wardrobe change, nothing else. Same woman, same wall, same light — only the masked region was re-imagined. a woman in a plain white t-shirt standing against a brick wall --ar 4:5 --v 8.1 → Edit · mask the shirt · region prompt: a denim jacket over a striped shirt

7.2Retexture — same structure, new skin V8.1 · Edit

Retexture regenerates the whole image in a new style while keeping its structure — composition, geometry and pose survive; the surface changes.

original in the editorbicycle photo in the editor
retexture · watercolorbicycle retextured as a watercolor painting
retexture · neon 3Dbicycle retextured as a neon-lit 3D render
Fig 7.2One bicycle, three materials. The frame never moves; the lean against the wall never changes — only the world it's rendered in. Compare with --sref (Part 3), which restyles at generation time; Retexture restyles an image you already have. a simple still life of a bicycle leaning against a wall --ar 3:2 --v 8.1 → Edit · Retexture: watercolor painting · then: neon-lit 3D render at night

7.3Canvas-expansion as storytelling V8.1 · Edit

The same Edit workflow as the lighthouse in 1.3 — but used narratively. A tight, anonymous frame; then the canvas widens and the story arrives.

base · 1:1a single candle burning in darkness
extended in Editthe candle revealed in a vast cathedral
Fig 7.3The reveal. The candle never changes — the world grows around it. Keep the added-region prompt about the new space only. a single candle burning in darkness --ar 1:1 --v 8.1 → Edit · widen canvas · added-region prompt: the candle in a vast cathedral hall
canvas widened in Editeditor with the candle and a widened empty canvas
base gridcandle base grid
Fig 7.3bThe mechanics. The widened frame waits empty around the original square before the cathedral is prompted into existence.

7.4Tiling — seamless patterns --tile

The --tile flag makes every edge match its opposite edge — one generation becomes infinite wallpaper.

one tileseamless botanical pattern tile
tiled 3×3the same tile repeated 3 by 3
Fig 7.4One tile, infinite fabric. Look for the joins in the right-hand panel — there aren't any. Made for textiles, wrapping paper, game textures and website backgrounds. a seamless pattern of botanical leaves and berries, flat illustration --tile --ar 1:1

7.5Image-to-video — pressing Animate V1 video

Any strong still can move. Generate, press Animate, describe the motion — Low motion for ambience, High when everything should travel. Image references aren't compatible with video.

the still · gridJapanese garden still grid
Animate · motion promptthe Animate interface with motion prompt
Fig 7.5From frame to film. The chosen still goes in; a motion prompt — not an image prompt — tells it how to breathe. a lantern-lit Japanese garden at night, koi pond, falling maple leaves --ar 16:9 motion: leaves drift down, koi swim, lantern light flickers, gentle ripples on the water
Fig 7.5bThe result, playing live. Five seconds of falling maple leaves, drifting koi and flickering lanterns — ≈8× the GPU cost of a still, and worth every minute. (Print edition: this becomes a 3-frame filmstrip.)
↑ All chapters
08

Part Eight · to generate

Troubleshooting — “Why Did It Do That?”

Small didactic comparisons for the pitfalls chapter: conversational prompts, superlative soup, forgotten aspect ratios, and the seed that brings an image back.

8.1Conversational vs Midjourney-style V8.1

Midjourney isn't a chatbot. Politeness, filler and full sentences become noise the model must interpret around.

conversationalcat from a conversational prompt
Midjourney-stylecat from a clean descriptive prompt
Fig 8.1Both work — one is steered. The conversational version lands somewhere pleasant; the descriptive version lands where you aimed: windowsill, warm afternoon, soft background, 4:5. Hello, could you please make me a really nice picture of a cat that is sitting by a window looking thoughtful in the afternoon? a thoughtful cat sitting on a windowsill, warm afternoon light, soft focus background --ar 4:5

8.2Over-stuffed vs trimmed V8.1

Superlatives are not instructions. V7 and V8 reward restraint — four concrete nouns beat fourteen adjectives.

over-stuffedover-stuffed prompt result
trimmedtrimmed prompt result
Fig 8.2The kitchen-sink prompt vs the brief. Both are mountain lakes at sunset — but the trimmed one has a composition, because every word in it was a decision. a beautiful stunning gorgeous highly detailed ultra realistic amazing epic majestic breathtaking landscape with mountains and a lake and trees and a sunset and birds and clouds and reflections, 8k, masterpiece, award winning a mountain lake at sunset, mirror reflection, pine forest, dramatic clouds --ar 16:9

8.3Forgetting the aspect ratio V8.1

A panorama needs a panoramic canvas. Set --ar before you generate, not after you're disappointed.

no --ar · default framedesert highway in the default frame
--ar 21:9desert highway at 21:9
Fig 8.3Give the road room. Without --ar the scene squeezes into whatever frame the model defaults to; at 21:9 the horizon finally behaves like a horizon. a panoramic desert highway disappearing into the horizon a panoramic desert highway disappearing into the horizon --ar 21:9

8.4Seed reproducibility V8.1 · “99% identical”

The same seeded prompt, run twice on different occasions. Seeds make experiments fair — and revisions possible.

run 1 · seed 4242paper boat, first run
run 2 · seed 4242paper boat, second run
Fig 8.4Two runs, one boat. Near-identical — note V8.1 promises “99% identical,” not pixel-exact. Lock a seed when you want to test one variable at a time; that discipline built every comparison in this book. a paper boat floating on a puddle, reflection of city lights --seed 4242 × 2
↑ All chapters
09

Appendices

Parameters, Control Subjects, Rights & Law

The reference half: every parameter, the reusable control subjects, and what a working creator needs to know about rights and the law.

AParameter quick reference

ParameterRange / defaultWhat it does
--arup to 14:1 · def 1:1Aspect ratio — set it before you generate
--s stylize0–1000 · def 100How much Midjourney aesthetic is imposed
--c chaos0–100 · def 0Variety between the four grid images
--w weird0–3000 · def 0Unconventionality of each image
--style rawflagLiteral mode — the realism foundation
--nonounsNegative prompt
--seed0–4.29bnReproducibility (V8.1: “99% identical”)
--tileflagSeamless repeating patterns
--sref / --swsw 0–1000 · def 100Style reference and its strength V7
--oref / --owow 1–1000 · def 100Omni Reference — characters & objects V7 only
--iwup to 3Image-prompt vs text influence V7
--pmoodboard IDPersonalization / moodboards (strength via --s)
--v / --niji6–8.1 · niji 7Model version — tag every figure

BReusable control subjects

Pick one per chapter and reuse it across every variable so readers see apples-to-apples: people — “a young woman laughing at a kitchen table” / “a weathered fisherman”; object — “a single ripe pear on a wooden table”; scene — “a lone lighthouse on a rocky cliff”; action — “a mountain climber on a ridge.”

CRights, rules & the law drafted · verify before print

Ownership: paid subscribers own what they create “to the fullest extent possible under applicable law,” with broad commercial rights that survive cancellation; companies grossing over $1M/year must be on Pro or Mega. Copyrightability: the US Copyright Office holds that purely AI-generated images are not copyrightable — only substantial human authorship is protectable, so refine outputs in an editor to strengthen claims. Moderation: roughly PG-13, enforced everywhere including private and stealth work. Litigation: Disney/Universal (June 2025) and Warner Bros. Discovery (Sept 2025) suits remain active through 2026 — this chapter must be re-verified at press time. Best practice taught throughout this book: lean on medium, technique and era language plus image-based --sref, not living artists' names.

↑ All chapters
Glowing words condensing into a phoenix

www.rupertchesman.com