Prompt Engineering — talk to AI like a pro — 7. Chaining prompts: multi-step workflows

11 June 2026 Mis à jour le 11 June 2026 20 min read min de lecture

Prompt Engineering — talk to AI like a pro

10 chapters · free

Anatomy of a prompt that works
Few-shot: learning by example
Guiding the reasoning
Role, constraints & format
Templates & debugging
System prompts & advanced personas
Chaining prompts: multi-step workflows
Structured data: extract, classify, normalize
Evaluating and testing your prompts
Capstone: your professional prompt library

Prompt Engineering — talk to AI like a pro › Chapter 7

Chapter 07

Chaining prompts: multi-step workflows

Chapter 7 of 10 · 70%

Chapter objectives

Turn a decomposition into a reusable pipeline with intermediate artifacts
Place human validation at the point of maximum leverage
Use the generator-critic loop and debug a chain step by step

The wall of the single prompt

Every month, Sofia produces the Planiresto newsletter: research across three sources, an editorial angle, a 600-word article, an email subject line that makes people want to open. Armed with chapter 4, she wrote a masterful forty-line prompt asking for everything at once. The result: a text that's correct but oddly flat, research takeaways skimmed over, an angle chosen by default — and no way to know which part of the prompt to fix, since everything comes out as one block.

Chapter 3 already gave the principle: decompose. This chapter turns it into a tooled method: prompt chaining, where each step produces an intermediate artifact — a list, an outline, a draft — that becomes the input of the next step. The difference with the improvised decomposition of chapter 3: here, the chain is designed in advance, documented, and reused identically every month. It's a production process, not a conversation anymore.

Why does a chain systematically beat the mega-prompt on complex jobs? Three reasons. Quality: each step gets the model's full attention, instead of sharing a single effort. Control: you can inspect and fix each artifact before it contaminates what follows. Diagnosis: when the final result disappoints, you know which step to look at — exactly like the debugging tree of chapter 5, but applied to an entire process.

Designing the chain: steps, artifacts, checkpoints

A chain is designed backwards, starting from the deliverable. For the newsletter: you need a finalized article; so before that, a critiqued draft; so before that, a draft; so before that, a validated angle; so before that, research takeaways. Five steps, four intermediate artifacts. For each step, note three things: what it receives, what it produces, and who validates — you or nobody.

flowchart LR
  A["Research sources"] --> B["Step 1: key takeaways"]
  B --> C["Step 2: three proposed angles"]
  C --> V{"Human validation of the angle"}
  V -->|"Angle chosen"| D["Step 3: full draft"]
  D --> E["Step 4: targeted critique"]
  E --> F["Step 5: final version + subject line"]

The newsletter chain: each step produces an artifact the next one consumes; the human validates at the leverage point.

Look at where the validation is placed: after the angle proposals, before the writing. That's the point of maximum leverage: the most structuring decision (what are we talking about, from what angle) is made on a short artifact readable in thirty seconds, before hundreds of words get written in the wrong direction. Validating earlier would control nothing (the raw takeaways are factual); validating later would be expensive (rewriting 600 words because the angle was wrong). Always place your validation where the decision is still small and its consequences still large.

Step 1: extracting the takeaways

PROMPT

You are a research analyst for a company making scheduling software for restaurant owners.

Here are three research pieces, each between delimiters:
--- SOURCE 1 ---
{{article 1}}
--- SOURCE 2 ---
{{article 2}}
--- SOURCE 3 ---
{{study 3}}
--- END ---

Task: extract the 5 most useful takeaways for restaurant managers, citing the source of each.

Format: numbered list; each takeaway = 1 sentence of fact + 1 sentence "why it matters for a restaurant owner" + the mention [Source N]. Only what appears in the sources: if a point is an interpretation, flag it with [interpretation].

The whole course converges in this step prompt: role with audience (chapter 4), delimited sources (chapter 1), strict format with traceability (chapter 4), and a guardrail against invention ("only what appears in the sources"). The artifact produced — the sourced list — is short, verifiable in two minutes, and becomes the input of the next step. That's the key property of a good intermediate artifact: quickly inspectable.

Steps 2 and 3: the artifact as a baton pass

PROMPT

Here are the validated takeaways from this month's research:
{{list from step 1, corrected}}

Task: propose 3 editorial angles for the monthly newsletter, from the safest to the boldest.

Format for each angle: a working title, the main takeaway it exploits, the promise to the reader in 1 sentence, and the risk of this angle in 1 sentence. Do NOT write the article: only the angles.

Two details make this link robust. "Here are the validated takeaways": Sofia pastes the list she has reread and possibly corrected — the chain transports controlled information, not a machine's raw output. And "do not write the article": without this boundary, the model happily spills over into the next step, and you receive an article based on an angle you didn't choose. Each step must do its job, all of its job, and nothing but its job.

Step 3 — the writing — follows the same relay pattern, and it's the one that benefits most from the upstream work: the substance has been validated twice, the model only has to write well. That's the mental inversion of chaining: secure the thinking first, the writing second — whereas the mega-prompt did everything at once, and everything averagely.

PROMPT

Chosen and validated angle: {{angle 2, adjusted after review}}.

Write the full draft of the April newsletter.

Constraints:
- About 600 words. Structure: introduction, 3 titled parts, conclusion with a call to action.
- Direct, concrete tone, address the reader directly, one example or figure per part.
- Rely ONLY on the validated takeaways below, without adding outside facts:
{{list of validated takeaways}}
- Do not write the email subject line or any variants: only the body of the newsletter.

Note the last constraint, another step boundary: the email subject line is deliberately postponed to step 5, after the critique. Why? Because a good subject line is chosen on the final version of the text, not on a draft that will still move — and because the subject line deserves its own prompt regime: at step 5, Sofia asks for "8 subject lines under 50 characters, from the most sober to the most intriguing", an open implicit-temperature phrasing (chapter 4), whereas writing the body demanded a closed temperature. A well-designed chain tunes the regime of each step to its nature: locked-down factual for research, unbridled creative for the subject line.

Store the artifacts in a document rather than leaving them in the conversation thread: a "newsletter-march.md" file with one section per step. You can fix each artifact directly in the document, and the final document tells the whole process — precious for improving the chain the following month.

Step 4: the generator-critic loop

The fourth step applies the self-critique of chapter 3, but specializes it: rather than a vague "reread and improve", the critique receives a precise grid — and that's where the personas of chapter 6 come back on stage.

PROMPT

Here is the draft of our newsletter:
---
{{draft from step 3}}
---
Critique it along 4 axes, without rewriting it:
1. Hook: does an overwhelmed restaurant owner read past the first sentence? Why?
2. Concreteness: does each part contain a usable example or figure?
3. Tone: spot any sentence that sounds corporate or hollow, quote it.
4. Action: does the ending give a clear reason to click?

Format: for each axis, a one-sentence verdict + a quote of the passage concerned + a concrete suggestion. End with the 2 priority fixes.

Keep the critique well separated from the rewrite: a prompt that asks for both at once produces a complacent critique and a timid rewrite. In two stages — critique first, then "apply fixes 1 and 3, leave the rest" — you keep control over what changes. And the "quote the passage concerned" forces the critique to be verifiable: a critique without a quote is an opinion, a critique with a quote is a diagnosis.

One conversation or several? The context question

Should you run the chain in a single conversation or open one per step? Both work, with a trade-off. A single conversation keeps the context (the model remembers past steps), but the history gets heavy and old instructions lose weight — the drift seen in chapter 4. Separate conversations start clean, but you must re-paste the input artifact yourself each time.

Sofia's practical rule: separate conversations for the critical steps (the step 4 critique is more honest when the model didn't write the draft itself in the same thread — it doesn't have to defend its own work), a single conversation for short sequences. And in all cases, each step prompt explicitly re-pastes its input artifact: the chain must never depend on the conversation's memory to work. That's what makes it replayable the following month.

Beware of cascading error: a false takeaway at step 1 travels through all subsequent steps with full confidence and ends up in the published newsletter. Upstream artifacts deserve your most careful proofreading — one minute of verification at step 1 saves a complete rewrite at step 5.

Parallel variants and the documented chain

Chaining has a cousin: parallelization. At step 3, rather than one draft, Sofia sometimes asks for two on the same angle — one classic, one bolder — then submits both to the step 4 critique and keeps the better one. Generating variants costs a few seconds; choosing between two critiqued versions almost always gives a better final than improving a single version. Reserve this luxury for high-stakes content: it's extra review time.

What remains is to capitalize. A chain that has proven itself gets documented like a chapter 5 template, but at process scale: a "Newsletter chain" document listing the steps, each step's prompt with its {{ }} fields, the expected artifact, and the validation point. The next month, Sofia unrolls the document instead of reinventing — and every improvement found along the way is carried back into the document. Her newsletter went from a painful full day to two calm hours, with a quality her old mega-prompt never came close to.

One last reflex to mature your chains: measure them. Note, at every run, the time spent per step and the place where you corrected the most. These two numbers designate the next improvement without debate: if step 4 sends you the same two critiques every month, the step 3 prompt should integrate those requirements upfront; if the angle validation has taken you thirty seconds for three months, maybe it can be lightened. A chain isn't a monument: it's a process that gets refined through use, exactly like a template — and the document that describes it is where that refinement accumulates instead of evaporating.

🛠️ Your turn

Context

Sofia has to produce the April newsletter and wants to make it count: run the complete five-step chain, measure the time spent, and document the process so it can be replayed in May by anyone on the team. She has her three research sources at hand and a clear goal: a 600-word article whose angle was chosen consciously, not by default.

Instructions

Choose a recurring deliverable that currently demands one big single prompt (newsletter, report, brief) and cut it backwards: from the final deliverable to the necessary artifacts.
For each step, note what it receives, what it produces, and whether you validate — then spot YOUR leverage point: the most structuring decision on the shortest artifact.
Write the step 1 prompt with role, delimited sources, artifact format and anti-invention guardrail; run it and correct the resulting artifact.
Chain the steps by explicitly re-pasting each validated artifact; at the writing step, set a boundary ("do not write the next step").
Add a critique step with a grid (3-4 axes, quotes required) in a separate conversation, then apply only the fixes you choose.
Document the chain in a file: steps, fill-in-the-blank prompts, artifacts, validation point — and note the time spent to compare next month.

Hint — Design the chain backwards, starting from the deliverable. And place your validation where the decision is still small and its consequences still large: it's almost always right before the long writing.

In summary

A prompt chain replaces the mega-prompt: each step produces a short, inspectable artifact that feeds the next.
Design backwards from the deliverable, and note for each step: what it receives, what it produces, who validates.
Human validation goes at the leverage point: structuring decision, short artifact, before the costly production.
Each step does all of its job and nothing but its job: set boundaries ("do not write the article") to avoid spillover.
The critique is separated from the rewrite, receives a precise grid and quotes its passages — ideally in a fresh conversation.
An upstream error travels through the whole chain: check the first artifacts with the most care.
Document the chain (steps, fill-in-the-blank prompts, validations): it becomes a replayable process, not a monthly improvisation.

Quiz — check your understanding

1. What is the main advantage of a chain over a single mega-prompt?

It costs less in tokensEach step is inspectable and correctable before contaminating what followsIt avoids writing promptsThe model answers faster

Quality, control and diagnosis: you inspect each artifact, and when the final disappoints, you know which step to look at.

2. Where should human validation go in a chain?

At the very end, on the deliverableAt every step without exceptionAt the leverage point: structuring decision, short artifact, before the long productionNowhere, the chain is automatic

Validating the angle in 30 seconds before writing 600 words: the decision is small, its consequences are large. Too early controls nothing, too late costs a rewrite.

3. Why bound a step with "do not write the article, only the angles"?

To shorten the answer and save reading timeBecause the model happily spills over into the next step, short-circuiting your validationBecause models can’t write articlesTo respect the context limit

Without a boundary, you receive an article based on an angle you didn't choose — the validation at the leverage point gets skipped.

4. Why run the critique in a separate conversation?

It’s technically mandatoryTo pay lessThe model critiques a text it didn’t write in the same thread more honestlyTo keep the history secret

In the same thread, the model tends to defend its own work. A fresh context, plus a grid with required quotes, gives a real diagnosis.

5. What is cascading error?

A bug in chat interfacesAn error in an upstream artifact that travels through all the steps and ends up in the deliverableA repeated spelling mistakeA prompt too long that overflows the context

A false takeaway at step 1 contaminates angles, draft and final version. Hence the priority proofreading of the first artifacts.

6. How do you make a chain replayable the following month?

Keep the conversation open permanentlyMemorize itTrust the tool’s historyDocument it: steps, fill-in-the-blank prompts, expected artifacts, validation points

Each step prompt explicitly re-pastes its input artifact and everything lives in a document: the chain depends neither on your memory nor on the conversation's.

← PreviousSystem prompts & advanced personas Next →Structured data: extract, classify, normalize

Auteur(s)

REHOUMA Haythem

Haythem Rehouma est un ingénieur et architecte IA et cloud, formateur et enseignant technique, avec un profil orienté IA médicale, AWS, MLOps, LLM/RAG et vision par ordinateur.