AGENCYSCRIPT
CoursesEnterpriseBlog
đź‘‘FoundersSign inJoin Waitlist
AGENCYSCRIPT

Governed Certification Framework

The operating system for AI-enabled agency building. Certify judgment under constraint. Standards over scale. Governance over shortcuts.

Stay informed

Governance updates, certification insights, and industry standards.

Products

  • Platform
  • Certification
  • Launch Program
  • Vault
  • The Book

Certification

  • Foundation (AS-F)
  • Operator (AS-O)
  • Architect (AS-A)
  • Principal (AS-P)

Resources

  • Blog
  • Verify Credential
  • Enterprise
  • Partners
  • Pricing

Company

  • About
  • Contact
  • Careers
  • Press
© 2026 Agency Script, Inc.·
Privacy PolicyTerms of ServiceCertification AgreementSecurity

Standards over scale. Judgment over volume. Governance over shortcuts.

On This Page

Before You GenerateHave You Described the Outcome?Have You Listed the Hard Constraints?While Reviewing the DraftDid You Read Every Line?Did You Flag Any Invented Constraints?Does It Still Cover Your Requirements?During TestingDid You Run Three to Five Real Cases?Did You Compare Against a Baseline?Before Locking It InDid Quality Plateau?Did You Record Its Boundaries?Did You Store It With Context?When to Skip Items SafelyFor One-Off TasksFor Already-Vetted PromptsHow to Use the List in PracticeKeep It VisibleTrim It to Your FailuresTurn Repeated Catches Into HabitsThe Checklist at a GlanceBefore GeneratingWhile ReviewingDuring TestingBefore LockingFrequently Asked QuestionsDo I need to run the whole list every time?Which single item catches the most problems?Can I add my own items?Is the baseline comparison really necessary?Key Takeaways
Home/Blog/Run This List Before You Ship a Prompt-Writing Prompt
General

Run This List Before You Ship a Prompt-Writing Prompt

A

Agency Script Editorial

Editorial Team

·November 3, 2022·6 min read
meta-promptingmeta-prompting checklistmeta-prompting guideprompt engineering

A checklist is only useful if you actually run it, and you will only run it if it is short enough to be painless and justified enough to feel worth it. This one is built to those constraints. It covers the moments in a meta-prompting workflow where things quietly go wrong, and it gives you a reason for each item so you can drop the ones that do not apply to your situation.

Treat it as a pre-flight scan rather than a tutorial. It assumes you already understand the basic loop of generating, testing, and refining a prompt; if you do not, start with Build Prompts That Generate Better Prompts, Step by Step and come back. The value here is catching the specific mistakes that slip past even experienced practitioners.

Run the list whenever you are about to promote a generated prompt from experiment to regular use. That promotion is the riskiest moment, because a flawed prompt that becomes a template multiplies its flaw across every future run.

Before You Generate

The setup determines the ceiling on quality, so check it first.

Have You Described the Outcome?

You should be able to point at a sentence describing the ideal result. If you only have instructions and no target, the model has nothing concrete to design toward, and the generated prompt will drift generic.

Have You Listed the Hard Constraints?

Length limits, banned words, required facts, and tone should be written down before generation. Constraints you forget to state are constraints the model will either ignore or invent on your behalf.

While Reviewing the Draft

This is the highest-leverage section of the entire list.

Did You Read Every Line?

Skimming a generated prompt defeats the purpose. The whole point of producing the prompt as a readable artifact is to inspect it. Read it slowly, looking for surprises.

Did You Flag Any Invented Constraints?

Search specifically for rules you never asked for, especially confident factual claims like "industry standard is X." These are the silent failures detailed in Seven Ways Self-Writing Prompts Quietly Go Wrong, and they are invisible once the prompt runs.

Does It Still Cover Your Requirements?

Confirm the prompt did not quietly drop a constraint you cared about while adding ones you did not. Generation sometimes trades your priorities for its own defaults.

During Testing

A prompt is unproven until it survives real inputs.

Did You Run Three to Five Real Cases?

One success is luck. A handful of cases reveals consistency, which is the property that actually matters for a reusable prompt. Skipping this is the most common shortcut and the most costly.

Did You Compare Against a Baseline?

Run your old rough request on the same cases. If the generated prompt is not clearly better, it is not ready, and you have learned that before committing to it.

Before Locking It In

The promotion step deserves its own gate.

Did Quality Plateau?

You should have hit two consecutive refinement rounds with roughly equal quality. If outputs are still improving fast, keep going; if they have stalled, stop before you over-engineer, a balance covered in Habits That Separate Sloppy From Sharp Prompt Generation.

Did You Record Its Boundaries?

Note where the prompt works and where it does not. A prompt tuned for short posts will fail on long reports, and writing that down prevents a future misapplication.

Did You Store It With Context?

Save the final prompt with a note on its purpose. An unsaved prompt is a one-time benefit; a stored one compounds, as the team in How an Agency Cut Prompt Drafting Time by Half discovered.

When to Skip Items Safely

A checklist you run mindlessly becomes friction, so it helps to know which items are situational.

For One-Off Tasks

If you are meta-prompting a task you will genuinely never repeat, the storage and boundary items add no value, and the full refinement gate is overkill. The inspection items still matter, because an invented constraint can ruin even a single output. The rule of thumb: keep the cheap safety checks, drop the asset-management ones.

For Already-Vetted Prompts

When you rerun a prompt you previously locked in, you do not re-review or re-baseline it. You only revisit the testing items if the inputs have changed meaningfully. A vetted prompt earns the right to skip most of the list until something about the task shifts.

How to Use the List in Practice

The list works best as a lightweight ritual rather than a bureaucratic form.

Keep It Visible

Pin the list somewhere you will actually see it at the moment of promotion, when a prompt graduates from experiment to regular use. A checklist filed away and forgotten protects nothing. The point is to catch yourself precisely when you are tempted to ship an unread draft.

Trim It to Your Failures

Over time, you will notice which items consistently catch problems for you and which never fire. Keep the ones that earn their place and quietly retire the rest. A short, trusted list you run every time beats an exhaustive one you skip. The discipline of matching the tool to your real failure modes mirrors the broader judgment in Habits That Separate Sloppy From Sharp Prompt Generation.

Turn Repeated Catches Into Habits

If the same item catches the same mistake repeatedly, that is a signal to build the check into how you work rather than relying on the list to remind you. The goal is for the most important items, inspecting the draft and testing on real cases, to become reflexes you no longer need prompting to perform.

The Checklist at a Glance

For quick reference, here is the full sequence condensed into a single scan you can run in under a minute.

Before Generating

  • Can you point at a one-sentence description of the ideal outcome?
  • Have you written down the hard constraints: length, tone, banned words, required facts?

While Reviewing

  • Did you read the draft line by line rather than skimming it?
  • Did you flag any constraint or factual claim you never requested?
  • Does the prompt still cover every requirement you cared about?

During Testing

  • Did you run it on three to five real cases, not just one?
  • Did you compare the results against your old rough baseline?

Before Locking

  • Has quality plateaued across two consecutive refinement rounds?
  • Did you note where the prompt works and where it does not?
  • Did you store it with a short description of its purpose?

Running this condensed version at the moment of promotion catches the overwhelming majority of avoidable failures, and it costs almost nothing in time once the items are familiar.

Frequently Asked Questions

Do I need to run the whole list every time?

Run it fully the first time you promote a prompt to regular use. For routine reruns of an already-vetted prompt, only the testing section matters when inputs change.

Which single item catches the most problems?

Reading every line of the draft. The majority of silent meta-prompting failures are visible in the prompt and invisible in the output, so inspection is where they get caught.

Can I add my own items?

Yes, and you should. The list is a starting default. If your work has a recurring failure mode not covered here, add a check for it and keep the list short by dropping items that never trigger.

Is the baseline comparison really necessary?

During design, yes. It is the only way to know whether the generated prompt actually helped rather than just sounding more elaborate. Once locked in, the baseline has done its job.

Key Takeaways

  • Before generating, write down the outcome and the hard constraints.
  • Read every line of the draft and flag any constraint you did not request.
  • Test on three to five real cases and compare against your rough baseline.
  • Lock the prompt only after quality plateaus across two refinement rounds.
  • Store the final prompt with its boundaries so the benefit compounds.

Search Articles

Categories

OperationsSalesDeliveryGovernance

Popular Tags

prompt engineeringai fundamentalsai toolsthe difference between AIMLagency operationsagency growthenterprise sales

Share Article

A

Agency Script Editorial

Editorial Team

The Agency Script editorial team delivers operational insights on AI delivery, certification, and governance for modern agency operators.

Related Articles

General

Prompt Quality Decides Whether AI Earns Its Keep

Prompt quality is the single biggest variable in whether AI delivers real work or expensive noise. The model matters, the platform matters — but the prompt you write determines whether you get a first

A
Agency Script Editorial
June 1, 2026·10 min read
General

Counting the Real Cost of Every Token You Send

Tokens and context windows sit at the intersection of AI capability and operational cost—yet most business cases treat them as technical footnotes. That's a mistake that costs real money. Every time y

A
Agency Script Editorial
June 1, 2026·10 min read
General

Rolling Out AI Hallucinations Across a Team

Most teams discover AI hallucinations the hard way — a confident-sounding wrong answer makes it into a client deliverable, a legal brief, or a published report. The damage isn't just to the output; it

A
Agency Script Editorial
June 1, 2026·11 min read

Ready to certify your AI capability?

Join the professionals building governed, repeatable AI delivery systems.

Explore Certification