Opinionated Habits That Make AI Write Better Code

Best-practice lists for prompting tend to collapse into the same vague advice: be clear, be specific, give context. True, but useless without the reasoning that tells you when and how to apply it. A practice you do not understand is a ritual, and rituals break the moment your situation deviates from the example.

This article takes the opposite approach. Each practice below comes with the reasoning that justifies it, so you can adapt it rather than copy it. These are opinionated positions, drawn from the reality that generated code has to survive review and run in production, not just look right in a chat window. Where reasonable people disagree, this guide picks a side and explains why.

If a practice here contradicts something you have read elsewhere, the explanation should let you judge for yourself. That is the point. You should leave able to defend each habit, not just perform it.

Treat the Prompt as a Specification

The most useful mental shift is to stop thinking of the prompt as a question and start thinking of it as a specification.

Why Specifications Beat Questions

A question invites the model to interpret. A specification tells it what to build. When you write "how do I validate an email?" you get an explanation. When you write "write a function that validates an email against these rules, returning a boolean," you get the artifact you actually wanted. The specification framing forces you to make the decisions a spec requires—inputs, outputs, behavior on failure—which are exactly the decisions vague prompts leave dangerously open.

Apply It at the Right Altitude

A specification does not mean dictating implementation. Specify what the code does and the standards it meets; leave how to the model unless you have a reason not to. Over-specifying the implementation is a known failure mode covered in 7 Common Mistakes.

Front-Load Context, Ruthlessly Trimmed

Context is the highest-leverage input, but more context is not always better.

Show Real Code, Not Descriptions

A paragraph describing your conventions is weaker than one example function that demonstrates them. The model pattern-matches on examples far more effectively than it parses prose rules. One representative function teaches naming, structure, and error handling all at once. This is why showing beats telling, every time.

Trim to the Signal

At the same time, pasting an entire file when one function matters buries the signal in noise. Include the functions the new code will touch, the data shapes involved, and the conventions you care about—and nothing else. The discipline is curation, not volume. The framework article provides a structure for deciding what makes the cut.

Constrain the Output Format Deliberately

Unconstrained, models wrap code in explanation, scatter it across prose, or return more than you asked for.

Ask for Exactly What You Want Back

State the form of the response: a single function, a full file, a diff, code only, or code with a brief rationale. This is not pedantry—it determines how much you have to clean up before the output is usable. A prompt that ends with "return only the function, no explanation" saves a scroll-and-delete on every response.

Match Format to the Task

For a quick utility, code only is best. For something you are learning from or reviewing carefully, code plus a short explanation of the key decisions is worth the extra text. Choose deliberately rather than accepting whatever the model defaults to.

Make Verification Part of the Prompt

The best practitioners build verification into the request rather than bolting it on afterward.

Ask for Tests Alongside Code

Requesting tests with the implementation does two things: it gives you a verification harness, and it makes the model's assumptions explicit. The tests reveal what the model thought "correct" meant, which is often more informative than the code itself. Then read the tests with the same skepticism as the code—a test asserting wrong behavior is a trap.

Read Before You Run, Always

This is the non-negotiable. Read every line before executing it. Generated code carries subtle errors—nonexistent calls, security gaps, mishandled edge cases—that polish hides. Reading is the discipline that catches them, and it keeps you the author of your own codebase. The beginner's guide establishes this habit from day one for a reason.

Iterate Like a Conversation, Not a Lottery

When the first output misses, how you respond determines whether you converge or thrash.

Feed Back in the Model's Currency

Error messages are high-quality feedback because they pinpoint where the model's assumptions broke. Paste them verbatim. "It's broken" is low-information; a stack trace is precise. Describe the gap between expected and actual behavior in concrete terms, and the model corrects efficiently.

Restart When the Thread Sours

A thread that has anchored on a wrong assumption keeps reproducing it. The practice is to recognize the same error appearing twice and start fresh with a prompt that folds in what you learned. Persisting in a confused thread out of sunk cost is a documented mistake, not a virtue.

Build a Personal Prompt Library

The compounding practice that separates fast practitioners from slow ones is reuse.

Save What Works

When a prompt produces excellent results for a recurring task, save it as a template. Generating an endpoint, scaffolding tests, writing a migration—these recur, and a tuned template turns a fresh problem into a fill-in-the-blanks exercise. Over months this library becomes a serious productivity asset.

Encode Your Hard-Won Fixes

When you discover that adding a specific instruction reliably fixes a recurring error, bake it into the template. These accumulated corrections are the difference between a prompt that works the first time and one that needs three rounds. The case study traces how a team built exactly this kind of library.

Calibrate Effort to the Stakes

The final practice ties the others together: apply them in proportion to what the code is for.

Don't Over-Engineer Trivial Requests

A throwaway script does not need a specification, a context dump, or a generated test suite. Running the full discipline on a one-off command is wasted effort that trains you to resent the process. Drop to a clear one-line ask and a quick read of the result. The skill is not in always doing everything—it is in knowing what to skip.

Bring Full Rigor Where It Counts

Conversely, code that touches money, security, user data, or anything that will be maintained for years deserves every practice in this article and a second pass besides. The closer the code sits to real consequences, the more the front-loaded effort pays off in errors that never happen. Calibration is what keeps the discipline sustainable: heavy where it matters, light where it does not.

Frequently Asked Questions

Are these practices model-specific?

No. They are about process and communication, which hold across models and tools. Better models lower the error rate but do not make vague prompts, missing context, or skipped review acceptable. That is why these habits persist as tools improve.

How much time does all this add per task?

Less than the iterations it prevents. Front-loading context and constraints takes seconds and removes rounds of correction that take minutes. For trivial tasks you scale down; for production code the rigor pays for itself immediately.

What if my team disagrees on conventions to show the model?

That is a team problem worth solving regardless of AI. In the meantime, show the conventions for the specific area you are working in, drawn from the nearby code. Consistency within a module beats global consistency you do not have.

Is asking for tests always worth it?

For anything that has to be correct, yes—the tests double as a record of intended behavior. For throwaway scripts, no. The practice scales with stakes, like everything else here.

Key Takeaways

Frame the prompt as a specification, not a question, to force the decisions vague prompts leave open.
Show real example code rather than describing conventions, but trim context to the signal.
Constrain the output format deliberately so you spend no time cleaning up the response.
Build verification into the prompt by asking for tests, and always read every line before running it.
Iterate with verbatim error messages and restart cleanly when a thread anchors on a wrong assumption.
Save working prompts as templates and encode your hard-won fixes so they work the first time.

Treat the Prompt as a Specification

The most useful mental shift is to stop thinking of the prompt as a question and start thinking of it as a specification.

Why Specifications Beat Questions

Apply It at the Right Altitude

Front-Load Context, Ruthlessly Trimmed

Context is the highest-leverage input, but more context is not always better.

Show Real Code, Not Descriptions

Trim to the Signal

Constrain the Output Format Deliberately

Unconstrained, models wrap code in explanation, scatter it across prose, or return more than you asked for.

Ask for Exactly What You Want Back

Match Format to the Task

Make Verification Part of the Prompt

The best practitioners build verification into the request rather than bolting it on afterward.

Ask for Tests Alongside Code

Read Before You Run, Always

Iterate Like a Conversation, Not a Lottery

When the first output misses, how you respond determines whether you converge or thrash.

Feed Back in the Model's Currency

Restart When the Thread Sours

Build a Personal Prompt Library

The compounding practice that separates fast practitioners from slow ones is reuse.

Save What Works

Encode Your Hard-Won Fixes

Calibrate Effort to the Stakes

The final practice ties the others together: apply them in proportion to what the code is for.

Don't Over-Engineer Trivial Requests

Bring Full Rigor Where It Counts

Frequently Asked Questions

Are these practices model-specific?

How much time does all this add per task?

What if my team disagrees on conventions to show the model?

Is asking for tests always worth it?

For anything that has to be correct, yes—the tests double as a record of intended behavior. For throwaway scripts, no. The practice scales with stakes, like everything else here.

Key Takeaways

Frame the prompt as a specification, not a question, to force the decisions vague prompts leave open.
Show real example code rather than describing conventions, but trim context to the signal.
Constrain the output format deliberately so you spend no time cleaning up the response.
Build verification into the prompt by asking for tests, and always read every line before running it.
Iterate with verbatim error messages and restart cleanly when a thread anchors on a wrong assumption.
Save working prompts as templates and encode your hard-won fixes so they work the first time.

Opinionated Habits That Make AI Write Better Code

Treat the Prompt as a Specification

Why Specifications Beat Questions

Apply It at the Right Altitude

Front-Load Context, Ruthlessly Trimmed

Show Real Code, Not Descriptions

Trim to the Signal

Constrain the Output Format Deliberately

Ask for Exactly What You Want Back

Match Format to the Task

Make Verification Part of the Prompt

Ask for Tests Alongside Code

Read Before You Run, Always

Iterate Like a Conversation, Not a Lottery

Feed Back in the Model's Currency

Restart When the Thread Sours

Build a Personal Prompt Library

Save What Works

Encode Your Hard-Won Fixes

Calibrate Effort to the Stakes

Don't Over-Engineer Trivial Requests

Bring Full Rigor Where It Counts

Frequently Asked Questions

Are these practices model-specific?

How much time does all this add per task?

What if my team disagrees on conventions to show the model?

Is asking for tests always worth it?

Key Takeaways

Agency Script Editorial

Related Articles

Prompt Quality Decides Whether AI Earns Its Keep

Counting the Real Cost of Every Token You Send

Rolling Out AI Hallucinations Across a Team

Ready to certify your AI capability?

Opinionated Habits That Make AI Write Better Code

Treat the Prompt as a Specification

Why Specifications Beat Questions

Apply It at the Right Altitude

Front-Load Context, Ruthlessly Trimmed

Show Real Code, Not Descriptions

Trim to the Signal

Constrain the Output Format Deliberately

Ask for Exactly What You Want Back

Match Format to the Task

Make Verification Part of the Prompt

Ask for Tests Alongside Code

Read Before You Run, Always

Iterate Like a Conversation, Not a Lottery

Feed Back in the Model's Currency

Restart When the Thread Sours

Build a Personal Prompt Library

Save What Works

Encode Your Hard-Won Fixes

Calibrate Effort to the Stakes

Don't Over-Engineer Trivial Requests

Bring Full Rigor Where It Counts

Frequently Asked Questions

Are these practices model-specific?

How much time does all this add per task?

What if my team disagrees on conventions to show the model?

Is asking for tests always worth it?

Key Takeaways

Agency Script Editorial

Related Articles

Prompt Quality Decides Whether AI Earns Its Keep

Counting the Real Cost of Every Token You Send

Rolling Out AI Hallucinations Across a Team

Ready to certify your AI capability?