AGENCYSCRIPT
CoursesEnterpriseBlog
đź‘‘FoundersSign inJoin Waitlist
AGENCYSCRIPT

Governed Certification Framework

The operating system for AI-enabled agency building. Certify judgment under constraint. Standards over scale. Governance over shortcuts.

Stay informed

Governance updates, certification insights, and industry standards.

Products

  • Platform
  • Certification
  • Launch Program
  • Vault
  • The Book

Certification

  • Foundation (AS-F)
  • Operator (AS-O)
  • Architect (AS-A)
  • Principal (AS-P)

Resources

  • Blog
  • Verify Credential
  • Enterprise
  • Partners
  • Pricing

Company

  • About
  • Contact
  • Careers
  • Press
© 2026 Agency Script, Inc.·
Privacy PolicyTerms of ServiceCertification AgreementSecurity

Standards over scale. Judgment over volume. Governance over shortcuts.

On This Page

Step One: Write the Problem DownCapture these specificsPressure-test the paragraph before moving onStep Two: Decide Hosted or Self-HostedHow to decideWhat pushes you toward self-hostingStep Three: Determine Your Data NeedsTwo pathsStep Four: Choose the GlueStart minimalStep Five: Add Observability Before You Need ItWhat to instrumentObservability is cheapest to add before launchStep Six: Validate the Whole SystemHow to validateFrequently Asked QuestionsWhy write the problem down before choosing anything?Can I do these steps out of order?How long should this process take?What if I am not sure about my data needs?Do small projects really need observability?What if validation fails at the last step?Key Takeaways
Home/Blog/Step by Step Through an AI Tech Stack Decision
General

Step by Step Through an AI Tech Stack Decision

A

Agency Script Editorial

Editorial Team

·October 15, 2017·8 min read
choosing an AI tech stackchoosing an AI tech stack how tochoosing an AI tech stack guideai tools

Most advice about choosing an AI tech stack lists considerations without telling you what to do first, second, and third. That leaves you staring at a dozen valid concerns with no idea where to start. The decisions in an AI stack depend on each other in a specific order, and following that order is what keeps you from making a choice you have to unwind later.

This is a sequential process you can run today. Each step produces an output that feeds the next, so by the time you reach the end you have not just a list of components but a justified, coherent stack. Work through it in order, resist the urge to jump ahead to the fun part of picking tools, and you will end up with a stack you can defend rather than one you assembled by vibes.

Step One: Write the Problem Down

Before any tool enters the conversation, write a single paragraph describing the problem. This sounds trivial and is the step people skip most, which is why so many stacks solve the wrong problem well.

Capture these specifics

  • The exact task and what a correct output looks like.
  • The tolerance for error and what a wrong answer costs.
  • The latency and volume the use case demands.
  • The realistic budget per request.

This paragraph is the specification everything else gets measured against. If you cannot write it, you are not ready to pick tools, and any choice you make is a guess dressed up as a decision.

Pressure-test the paragraph before moving on

Once written, read the paragraph back and ask whether it would settle a disagreement. If two teammates read it and could still reasonably build different things, it is not specific enough yet. The most common vague spot is the error tolerance. "It should be accurate" is not a specification; "a wrong field here flows into a payment, so accuracy must exceed the current manual rate" is. The sharper the paragraph, the more decisions it makes for you automatically downstream. Spending an extra twenty minutes here saves days later, because every fuzzy phrase you leave in the paragraph becomes a fuzzy decision you have to make again under pressure.

Step Two: Decide Hosted or Self-Hosted

With the problem written, make the one decision that shapes the most: do you call a hosted model API or run a model yourself? This early fork determines your cost structure, your operational burden, and your data flow.

How to decide

  • Default to hosted unless a specific constraint rules it out.
  • Choose self-hosted only if data residency, scale economics, or customization genuinely force it.

Make this decision deliberately because it is expensive to reverse. The rest of the steps assume you have settled it. Choosing it carelessly is one of the failure modes detailed in 7 Common Mistakes with Choosing an AI Tech Stack.

What pushes you toward self-hosting

Most teams should hear the case for self-hosting and then realize it does not apply to them, which is fine; the point is to check rather than assume. Three constraints genuinely justify it. The first is data residency: if your data legally or contractually cannot leave your environment, a hosted API is off the table. The second is scale economics: at extreme volume, the per-call cost of a hosted model can exceed what running your own would cost, flipping the math. The third is deep customization that a hosted model cannot accommodate. If none of these three describe your situation, default to hosted and move on, because self-hosting brings an operational burden you will otherwise be carrying for no reason.

Step Three: Determine Your Data Needs

Now decide whether the model needs information beyond its general knowledge. Ask your problem paragraph: does answering correctly require facts the model could not already know?

Two paths

  • No external data: the model answers from general knowledge, and you skip the retrieval layer entirely for now.
  • External data: you need retrieval, which means a way to store your information and fetch the relevant pieces for each request.

Do not build retrieval you do not need. Many first stacks are over-engineered with vector stores for problems that never required them. Add the data layer when the problem demands it, not preemptively.

Step Four: Choose the Glue

With model and data decided, choose how they connect. This is the orchestration step, and the right answer for a first build is usually less than you think.

Start minimal

  • Write explicit code that takes a request, fetches data if needed, calls the model, and returns the result.
  • Keep prompt templates in one place you can version.
  • Add a framework only when explicit code becomes genuinely unwieldy.

Minimal glue is easier to debug and easier to reason about. You can always add structure later; removing a framework you outgrew is much harder.

Step Five: Add Observability Before You Need It

Build in the ability to see what your system is doing before you go live, not after something breaks. This step is short but easy to defer into a future that never arrives.

What to instrument

  • Log each request, the data retrieved, the prompt sent, and the response received.
  • Track latency and cost per request from day one.
  • Capture a sample of outputs for quality review.

Without this, your first production problem becomes a guessing game. With it, you can diagnose issues from evidence rather than intuition.

Observability is cheapest to add before launch

The reason this step lands at five rather than later is timing. Adding logging to a system that is already live, already failing, and already under scrutiny is stressful and rushed. Adding it before launch is calm and cheap, often just a few lines that record each request and response. The payoff arrives the first time something behaves strangely in production and you can pull up exactly what the system saw and did, instead of trying to reproduce a problem you cannot see. Teams that defer this step almost universally regret it, because the regret arrives precisely when they have the least time to act on it.

Step Six: Validate the Whole System

The final step is to test the assembled stack against the problem paragraph from step one. Not each component in isolation, but the whole thing doing the actual job.

How to validate

  • Run real examples end to end and check the outputs against your definition of correct.
  • Confirm latency and cost meet the targets you wrote down.
  • Deliberately feed bad inputs and confirm the system fails gracefully.

If the assembled stack meets the specification, you are done and you can defend every choice. If it does not, the step that broke tells you where to revisit. The whole sequence holds together the same way a good Everything That Goes Into an AI Tech Stack Decision overview describes, with each layer making the next coherent.

Frequently Asked Questions

Why write the problem down before choosing anything?

Because the problem statement is the specification every later choice gets measured against. Skipping it means you pick tools against an unstated, fuzzy target and often solve the wrong problem.

Can I do these steps out of order?

You can, but the later steps depend on the earlier ones. Choosing glue before deciding hosted versus self-hosted, for instance, means you may rebuild the glue once that decision changes everything around it.

How long should this process take?

For a first project, an afternoon for the early steps and a few days to validate. The point is not speed but doing each step deliberately so you are not unwinding decisions later.

What if I am not sure about my data needs?

Default to no retrieval and build the simplest version. If the outputs are wrong because the model lacks specific information, that is your signal to add the data layer. Let the problem reveal the need.

Do small projects really need observability?

Yes, even if it is just thorough logging. The cost is small and the payoff is huge the first time something behaves unexpectedly in production and you need to see why.

What if validation fails at the last step?

The component that failed tells you where to go back. Failed validation is the process working, catching a mismatch before it reaches users rather than after.

Key Takeaways

  • Write a one-paragraph problem statement first; it is the specification everything else is measured against.
  • Decide hosted versus self-hosted early, because it shapes cost, operations, and data flow.
  • Add a data layer only when the problem genuinely requires facts the model cannot already know.
  • Start with minimal, explicit glue and adopt a framework only when complexity demands it.
  • Build observability before launch so your first production issue is diagnosable from evidence.
  • Validate the assembled stack against your original problem statement, not just each part alone.

Search Articles

Categories

OperationsSalesDeliveryGovernance

Popular Tags

prompt engineeringai fundamentalsai toolsthe difference between AIMLagency operationsagency growthenterprise sales

Share Article

A

Agency Script Editorial

Editorial Team

The Agency Script editorial team delivers operational insights on AI delivery, certification, and governance for modern agency operators.

Related Articles

General

Prompt Quality Decides Whether AI Earns Its Keep

Prompt quality is the single biggest variable in whether AI delivers real work or expensive noise. The model matters, the platform matters — but the prompt you write determines whether you get a first

A
Agency Script Editorial
June 1, 2026·10 min read
General

Counting the Real Cost of Every Token You Send

Tokens and context windows sit at the intersection of AI capability and operational cost—yet most business cases treat them as technical footnotes. That's a mistake that costs real money. Every time y

A
Agency Script Editorial
June 1, 2026·10 min read
General

Rolling Out AI Hallucinations Across a Team

Most teams discover AI hallucinations the hard way — a confident-sounding wrong answer makes it into a client deliverable, a legal brief, or a published report. The damage isn't just to the output; it

A
Agency Script Editorial
June 1, 2026·11 min read

Ready to certify your AI capability?

Join the professionals building governed, repeatable AI delivery systems.

Explore Certification