Prompt Engineering — talk to AI like a pro — 10. Capstone: your professional prompt library

11 June 2026 Mis à jour le 11 June 2026 20 min read min de lecture

Prompt Engineering — talk to AI like a pro

10 chapters · free

Anatomy of a prompt that works
Few-shot: learning by example
Guiding the reasoning
Role, constraints & format
Templates & debugging
System prompts & advanced personas
Chaining prompts: multi-step workflows
Structured data: extract, classify, normalize
Evaluating and testing your prompts
Capstone: your professional prompt library

Prompt Engineering — talk to AI like a pro › Chapter 10

Chapter 10

Capstone: your professional prompt library

Chapter 10 of 10 · 100%

Chapter objectives

Structure a versioned library with complete cards
Set up the life cycle and team governance
Assemble everything from the course into a system that lasts

Time for the great assembly

Nine chapters have filled Sofia's pockets: templates (chapter 5), a Plume assistant (chapter 6), a newsletter chain (chapter 7), an extraction pipeline (chapter 8), test sets and a version log (chapter 9). The problem has changed in nature: it's no longer "how to write a good prompt", it's "how to find, maintain and share forty good prompts". Her "prompts that work" note from chapter 5 is bursting at the seams: duplicates, stale versions, and the intern asking three times a day "which one is the right prompt for reviews?".

This capstone chapter builds the answer: a professional prompt library — organized, versioned, governed. The word may sound pompous for a collection of texts; it isn't. Your prompts now encode weeks of measured iterations: they're a company asset, just like a knowledge base or source code. And an asset without organization depreciates — that's even its natural slope.

The prompt card: the basic unit

A library isn't a pile of prompts: it's a set of cards. The card wraps the prompt with everything needed to use it, judge it and evolve it without its author. Its content brings together the whole course: the template with {{ }} fields (chapter 5), its test cases (chapter 9), its usage context, its owner and its history.

PROMPT

[CARD — negative-review-reply — v4]

What: generates a draft reply to a negative customer review, in the Planiresto tone.
When to use it: any review ≤ 3 stars. NOT for legal threats (→ forward to management).
Owner: Sofia. Last updated: Mar 21.

TEMPLATE:
You are a customer relations officer at Planiresto [...see full prompt...]
Review to handle: {{review}}
Optional context: {{customer history, or "none"}}

VALIDATION: score 80/90 on test set review-tests.md (15 entries). Judge calibrated Mar 13.
EXAMPLE OF A SUCCESSFUL OUTPUT: [paste a representative example]

CHANGELOG:
v4 (Mar 21) — shortened few-shot examples + word-limit reminder. 80/90.
v3 (Mar 18) — REJECTED: length regression.
v2 (Mar 14) — rule "don't dispute the customer's word". 75/90.
v1 (Mar 12) — initial version. 68/90.

Each section answers a question someone will ask one day. "When to use it" prevents out-of-scope uses — the number one cause of "your prompt doesn't work". "Validation" provides the legitimacy: this prompt isn't an opinion, it's 80/90 on a known test set. The changelog tells why v4 is built the way it is — and spares your successor from retesting the rejected v3. A complete card makes the prompt orphan-proof: usable and improvable even if its author leaves tomorrow.

Organizing: by task, not by technique

How do you shelve forty cards? By business task, never by technique. The user searching thinks "I need to reply to a review", not "I need a few-shot with an LLM judge". Sofia's shelves: Social media, Emails & newsletter, Reviews & support, Data & analysis, Internal. Each card carries an action-oriented name ("negative-review-reply", "trade-show-lead-extraction") — the chapter 5 rule, now a team convention.

On the tooling side, the hierarchy of needs is simple. Alone: a folder of well-named notes is enough. As a team: a shared space (Notion, wiki, drive) with the card template as the mandatory model. As a technical team: a Git repository, where every prompt modification goes through review — the prompt/code parallel then becomes literal. Sofia picks the shared space: that's where the team already lives. The best library tool is the one you open without thinking.

Resist over-organization: five shelves and one card template are enough to start. A taxonomy of twenty subcategories and twelve mandatory fields kills contribution — nobody fills in twelve fields on a Friday at 5pm. The perfect but empty library loses to the imperfect but living one.

The life cycle: from idea to archive

A living library needs explicit states, otherwise yesterday's draft passes itself off as today's reference. Four states are enough: draft (being written, use at your own risk), tested (run on a test set, documented score), published (the reference version, the one the team uses), archived (replaced or obsolete — kept for the record, never deleted: an archived prompt documents a dead end or an era).

flowchart LR
  B["Draft: being written"] --> T["Tested: score on test set"]
  T --> P["Published: reference version"]
  P --> A["Improvement proposed"]
  A -->|"Reruns the test set"| T
  P --> X["Archived: replaced, never deleted"]

A prompt's life cycle: nothing becomes the reference without going through the test set — improvements included.

The most important arrow in the diagram is the return "improvement proposed → tested": any modification of a published prompt reruns the test set before becoming the new reference. That's the dike against slow degradation — the well-meaning colleague who "improves" a phrasing one Tuesday and silently breaks the trap cases. With the cycle, their improvement is welcome: it will be measured, and adopted if it wins. Chapter 9's evaluation wasn't an exercise: it's the library's immune system.

Governance: light but real

Three roles, no more. The card owner (its creator, by default): they arbitrate modifications to their card and keep the changelog. The librarian — Sofia —: they keep the overall coherence, merge duplicates, keep the card template alive and run the routine. The contributors — everyone: they use, report failures (a comment on the card is enough) and propose improvements. No meetings, no committee: traffic rules, not an administration.

The routine that holds it all together extends the chapter 5 one to the team: fifteen minutes on Friday. Everyone carries over their good prompts of the week as drafts; the librarian reviews the reports; at most one card moves from tested to published. Fifteen minutes — but every Friday. Sofia learned it the hard way: the library nearly died in April, when two skipped Fridays were enough for three prompts to go back to living in private conversations. Governance is first of all a kept appointment.

And when a disagreement arises — two contributors convinced their version of the newsletter prompt is the best — the library already has its courtroom: the test set. No opinion debate in a meeting: each version runs the fifteen entries, the scores decide, the changelog records. It's perhaps the most underrated benefit of chapter 9's evaluation: it doesn't just improve prompts, it pacifies their governance.

Putting the model to work on the library itself

Your library is text; and you now know how to have a model analyze text. An elegant loop: prompts can maintain prompts. Consistency audit, duplicate detection, card-template compliance — all of it can be delegated, under your supervision:

PROMPT

You are the auditor of a company prompt library. Here are our cards between delimiters:
--- CARDS ---
{{content of the cards, or the list name + What + When to use it}}
--- END ---

Analyze and report, in this order:
1. DUPLICATES: cards covering the same task — cite the names and the difference, if any.
2. CONTRADICTIONS: tone or format rules that are incompatible between cards that should be aligned.
3. INCOMPLETE CARDS: sections missing compared to the card template — card name + missing section.
4. BLIND SPOTS: recurring tasks of the communications team with no card, based on what the existing cards suggest.

Format: 4 sections, concise bullets, verifiable observations only — no rewriting recommendations.

Note the last line: the auditor observes, the human decides — the division of roles of the whole course, right down to maintenance. Sofia runs this audit at the end of every month; the first one found two duplicates and a tone contradiction between the newsletter card and the LinkedIn card that no human had noticed.

The virtual librarian: the library that answers

Last brick, and the most satisfying one: turning the library into a conversational partner. Sofia configures a dedicated assistant — same mechanisms as chapter 6: system prompt, knowledge files — whose attached knowledge is the export of the published cards (never the drafts), and whose mission is to direct, not to create:

PROMPT

You are the prompt librarian for the communications team at Planiresto. Your knowledge: the library's published cards, attached as files.

Mission: when someone describes a task to you, you point to THE right card (its name, its "When to use it") and provide its ready-to-fill template, {{ }} fields included.

Rules:
- You only recommend existing published cards. You never invent a prompt.
- If no card covers the task, say so and offer to report the gap to Sofia — it's precious information, not a failure.
- If two cards could fit, ask ONE question to decide instead of choosing at random.
- Always restate the card's exclusion scope if it has one, like legal threats for negative-review-reply.

The intern stopped asking "which one is the right prompt for reviews?": the library started answering by itself. And note the second rule, which elegantly closes the loop of the system: every "no card covers this task" reported by the virtual librarian is a blind spot detected in real conditions — exactly section 4 of the monthly audit, but fed continuously by usage. The library no longer just gets consulted: it signals by itself where it needs to grow.

Measure your library's value with two simple numbers: the production time of a recurring deliverable (Sofia's newsletter: from a full day to two hours) and the touch-up rate (the share of outputs used as is). It's these numbers — not the number of cards — that justify the Friday routine to management.

The final word: ten chapters, one skill

Look at the road traveled since the failed LinkedIn post of chapter 1. Sofia learned to structure a request (context, task, format), to show by example, to demand verifiable reasoning, to frame roles and machine outputs, to capitalize in templates. Then she changed scale: assistants that carry the rules, chains that produce entire deliverables, pipelines that turn text into data, an evaluation that turns opinion into measurement, and a library that turns it all into a team asset.

The progression fits in three words: from prompt to system. A successful prompt is worth once; a template is worth every week; a tested, governed library is worth for a whole team, durably. Models will keep changing — your cards will mention other names, your test sets will give other scores. But the skill you've just built — phrasing precisely, verifying honestly, capitalizing methodically — is exactly the one that will gain value as these tools become more powerful. The library is open. It's your turn to write the next cards.

🛠️ Your turn

Context

Friday, 2pm: Sofia blocks two hours to found the communications team's official library. The capstone's goal: structure the space, migrate the five best prompts of the course into complete cards, walk at least one card through the life cycle, run the first automated audit, and install the Friday routine. On Monday, the intern must be able to find, understand and use a card without asking a single question.

Instructions

Create your library space (notes, shared space or repository) with 4-5 shelves by business task and the card template as the model.
Migrate your 5 best prompts from the course into complete cards: What, When to use it, owner, template, validation, output example, changelog.
Assign a state to each card (draft, tested, published) — and promote your best card to published by rerunning its test set.
Run the audit prompt on your cards: handle at least one duplicate, contradiction or missing section it detects.
Do the orphan test: ask a colleague to use a published card without any verbal explanation, and note what blocked them.
Install the routine: 15 recurring minutes on the team calendar, and appoint the librarian — probably you.

Hint — Start by migrating the prompt you use MOST often, not the most impressive one: it's the one that will prove the library's value from the first week — and the Friday routine is worth more than the perfection of the cards.

In summary

Forty good prompts pose a new problem: finding, maintaining, sharing — the answer is a library, not a pile.
The card is the basic unit: what, when to use it, owner, template, numbers-based validation, output example, changelog.
Shelve by business task, never by technique: the user thinks "reply to a review", not "few-shot with a judge".
The life cycle (draft → tested → published → archived) imposes the test set on every modification: it’s the library’s immune system.
Light governance: one owner per card, a librarian, contributors — and above all a weekly routine that is kept.
The model maintains the library under supervision: duplicate and contradiction audits, a welcome assistant backed by the published cards.
Measure value in time saved and touch-up rate, not in number of cards.
The thread of the ten chapters: from prompt to system — the skill that remains when models change.

Quiz — check your understanding

1. What does a complete prompt card contain, beyond the template?

Only its author’s nameThe usage context, the owner, the numbers-based validation, an output example and the changelogThe price of the model usedA screenshot of the conversation

The card makes the prompt "orphan-proof": usable, judgeable and improvable even without its author. Each section answers a question someone will ask one day.

2. How should the library shelves be organized?

By technique (few-shot, chain, LLM judge)In strict alphabetical orderBy business task, with action-oriented card namesBy creation date

The user searches thinking about their task ("reply to a review"), not the technique used. The shelving follows the searcher's thinking.

3. What must every proposed improvement to a published prompt do?

Be applied directly if it seems betterWait for the monthly meetingBe voted on by the teamRerun the test set before becoming the new reference

That's the dike against slow degradation: the well-meaning improvement that silently breaks the trap cases gets measured before being adopted.

4. Why archive rather than delete a replaced prompt?

To inflate the number of cardsAn archived prompt documents a dead end or an era: it avoids retesting the same ideasBecause deletion is technically impossibleFor legal reasons only

Like the rejected v3 of chapter 9: the documented failure feeds the following successes. The archive is the library's long memory.

5. What is the main danger looming over a team library?

Too many published cardsSilent abandonment: skipped routine, prompts going back to private conversationsCard names that are too shortAn excess of tests

Two skipped Fridays were enough at Sofia's. Governance is first of all a kept appointment — the routine is worth more than the perfection of the cards.

6. How do you measure the library’s value to management?

By the number of cards createdBy the average length of the promptsBy the production time of recurring deliverables and the rate of outputs used as isBy the number of open conversations

A full day become two hours for the newsletter, outputs publishable without touch-ups: those are the numbers that justify the routine — not the volume of cards.

← PreviousEvaluating and testing your prompts Finished 🎉See all courses

Auteur(s)

REHOUMA Haythem

Haythem Rehouma est un ingénieur et architecte IA et cloud, formateur et enseignant technique, avec un profil orienté IA médicale, AWS, MLOps, LLM/RAG et vision par ordinateur.