AGENCYSCRIPT
CoursesEnterpriseBlog
đź‘‘FoundersSign inJoin Waitlist
AGENCYSCRIPT

Governed Certification Framework

The operating system for AI-enabled agency building. Certify judgment under constraint. Standards over scale. Governance over shortcuts.

Stay informed

Governance updates, certification insights, and industry standards.

Products

  • Platform
  • Certification
  • Launch Program
  • Vault
  • The Book

Certification

  • Foundation (AS-F)
  • Operator (AS-O)
  • Architect (AS-A)
  • Principal (AS-P)

Resources

  • Blog
  • Verify Credential
  • Enterprise
  • Partners
  • Pricing

Company

  • About
  • Contact
  • Careers
  • Press
© 2026 Agency Script, Inc.·
Privacy PolicyTerms of ServiceCertification AgreementSecurity

Standards over scale. Judgment over volume. Governance over shortcuts.

On This Page

What Open and Closed Source Actually MeanThe Three Axes of OpennessCapability: The Gap Is Narrowing but Not GoneCost: API Convenience Versus Infrastructure OwnershipA Rough Decision HeuristicControl, Privacy, and Data ResidencyCustomization DepthOperational Reality and Hidden CostsPutting It Together: A Portfolio ApproachVendor Risk and Strategic IndependenceFrequently Asked QuestionsAre open source AI models actually free?Can closed models match open models on privacy?Which is better for a startup just getting started?Do open-weight licenses really restrict commercial use?How often do I need to migrate models?Key Takeaways
Home/Blog/Who Holds the Weights, the Cost Curve, and the Risk
General

Who Holds the Weights, the Cost Curve, and the Risk

A

Agency Script Editorial

Editorial Team

·January 2, 2026·7 min read
open vs closed source AI modelsopen vs closed source AI models guideopen vs closed source AI models guideai fundamentals

Every team building with AI eventually hits the same fork in the road: do you call a closed model through an API, or do you run an open-weight model you can host yourself? The marketing around this choice makes it sound like a values question. It is not. It is an engineering and business decision about who holds the model weights, who controls the cost curve, and who absorbs the risk when something breaks.

This guide gives you the full picture without the tribalism. We will define what "open" and "closed" actually mean, since the terms are slippery, then work through capability, cost, control, compliance, and the operational reality of running models in production. By the end you should be able to look at a specific use case and say with confidence which side of the line it belongs on.

What Open and Closed Source Actually Mean

The labels hide more than they reveal. "Closed source" models like the GPT and Claude families are served exclusively through an API. You send tokens, you get tokens back, and you never touch the weights. The provider controls versioning, safety filtering, rate limits, and pricing.

"Open source" is the messier term. Most models people call open are really open-weight: the trained parameters are downloadable, but the training data and full training code often are not. Llama, Mistral, Qwen, and Gemma fall here. A smaller set, like some OLMo releases, are genuinely open across data and code. The distinction matters because an open-weight license can still restrict commercial use, redistribution, or scale.

The Three Axes of Openness

  • Weights: Can you download and run the parameters yourself?
  • License: Does the license permit your specific commercial use and scale?
  • Reproducibility: Are the data and training recipe public enough to rebuild or audit?

Most decisions hinge on the first two. Reproducibility matters mainly for research and high-assurance regulated work.

Capability: The Gap Is Narrowing but Not Gone

For years the assumption was that closed frontier models won every benchmark and open models trailed by a generation. That gap has compressed sharply. Top open-weight models now match closed models on many reasoning, coding, and summarization tasks, especially at mid-range sizes.

Where closed models still tend to lead is the absolute frontier: the hardest multi-step reasoning, long-context coherence, and the newest multimodal capabilities. If your work depends on the single best model available this quarter, closed is usually where it lives first. If your work needs "very good and consistent," open-weight models cover an enormous amount of ground.

For a deeper walk through specific deployments, see our real-world examples and use cases.

Cost: API Convenience Versus Infrastructure Ownership

Closed models charge per token. There is no upfront cost, no GPUs to provision, and you scale to zero when idle. This is unbeatable for low or spiky volume. The catch is that per-token pricing becomes brutal at high, steady volume, and you have no leverage to negotiate the unit economics down except by switching providers.

Open-weight models flip the equation. You pay for GPUs—rented or owned—plus the engineering time to serve, monitor, and update them. At low volume this is wildly inefficient. At high, predictable volume it can be dramatically cheaper per token, and the cost is fixed rather than variable.

A Rough Decision Heuristic

  • Under ~1M tokens/day, bursty: Closed API almost always wins on total cost.
  • Tens of millions of tokens/day, steady: Self-hosted open-weight often wins, sometimes by 5-10x per token.
  • In between: Model it both ways; the answer depends on your latency and uptime requirements.

Control, Privacy, and Data Residency

This is where open-weight models have a structural advantage that no amount of closed-model engineering can fully match. When you host the weights, data never leaves your environment. For healthcare, defense, finance, and anyone bound by strict data residency rules, that property is sometimes non-negotiable.

Closed providers have responded with enterprise tiers, zero-retention API options, and regional hosting. These are real and often sufficient. But they are contractual guarantees, not architectural ones. You are trusting a third party rather than controlling the boundary yourself.

Customization Depth

Closed models let you fine-tune within the limits the provider exposes. Open-weight models let you do anything: full fine-tuning, quantization, LoRA adapters, distillation into smaller models, and surgical modification of behavior. If your edge depends on a model that behaves differently from what competitors can rent, open is the path.

Operational Reality and Hidden Costs

The brochure comparison ignores the part that sinks teams: running an open-weight model in production is an ongoing engineering commitment. You own inference optimization, GPU availability, autoscaling, observability, security patching, and the painful work of evaluating and migrating to new model versions yourself.

Closed providers absorb all of that. You trade control for the fact that a team of specialists keeps the model fast, available, and current. Many organizations underestimate this and discover that their "cheaper" self-hosted setup costs more once they price in the senior engineers babysitting it. The common mistakes article covers this trap in detail in our 7 common mistakes guide.

Putting It Together: A Portfolio Approach

The mature answer is rarely all-or-nothing. Sophisticated teams run a portfolio: a closed frontier model for the hardest tasks and prototyping, and open-weight models for high-volume, latency-sensitive, or privacy-bound workloads. Routing logic sends each request to the cheapest model that can do the job.

This hybrid stance hedges vendor risk, controls cost, and keeps you close to the frontier. To structure that decision repeatably, our framework article gives you a named model you can reuse.

Vendor Risk and Strategic Independence

Beyond cost and capability, there is a strategic dimension that quietly shapes many decisions: dependence. Building a business entirely on one closed provider means your product's economics, capabilities, and availability are subject to another company's roadmap, pricing, and policy changes. A price increase, a deprecated model, or a usage policy shift can ripple straight into your product overnight.

Open-weight models are the natural hedge. Even if you never self-host today, the existence of capable open models that you could run gives you negotiating leverage and an exit path. Many teams keep an open model warm in their stack precisely as insurance, not because they use it heavily. The optionality has value even when unexercised.

This does not mean closed providers are dangerous; they are excellent and most teams should use them. It means that betting your entire business on a single external dependency, of any kind, deserves a deliberate look. A modest investment in portability—the thin model interface, a tested open-model fallback—buys real strategic resilience for very little ongoing cost.

Frequently Asked Questions

Are open source AI models actually free?

The weights are free to download under permissive licenses, but running them is not free. You pay for GPU compute, engineering time, and operational overhead. For most workloads the dominant cost is infrastructure and people, not licensing.

Can closed models match open models on privacy?

They can get close through zero-retention APIs and regional hosting, which satisfy many compliance regimes. But these are contractual promises rather than architectural guarantees. If your requirement is that data physically never leaves your environment, only self-hosted open-weight models deliver that.

Which is better for a startup just getting started?

Almost always a closed API. The speed to a working product, zero infrastructure burden, and access to frontier capability outweigh cost concerns at low early volume. Revisit the decision once usage grows large and predictable.

Do open-weight licenses really restrict commercial use?

Some do. Certain licenses cap usage at a scale threshold or restrict specific applications. Always read the actual license for the specific model version before building a business on it, because terms vary between models and even between releases of the same model family.

How often do I need to migrate models?

With closed APIs, the provider may deprecate versions on their schedule, forcing periodic migration. With self-hosted open models, you migrate on your own timeline but own all the evaluation work. Either way, budget for re-testing your prompts and pipelines a few times a year.

Key Takeaways

  • "Open" usually means open-weight, not fully open data and code; read the license for your specific use and scale.
  • Capability gaps have narrowed; open models cover most needs, while closed models still lead at the absolute frontier.
  • Closed APIs win on cost at low or bursty volume; self-hosted open models win at high, steady volume.
  • Self-hosting gives architectural privacy and unlimited customization but adds a real, ongoing engineering burden.
  • The strongest strategy is usually a hybrid portfolio with routing, not a dogmatic commitment to one side.

Search Articles

Categories

OperationsSalesDeliveryGovernance

Popular Tags

prompt engineeringai fundamentalsai toolsthe difference between AIMLagency operationsagency growthenterprise sales

Share Article

A

Agency Script Editorial

Editorial Team

The Agency Script editorial team delivers operational insights on AI delivery, certification, and governance for modern agency operators.

Related Articles

General

Prompt Quality Decides Whether AI Earns Its Keep

Prompt quality is the single biggest variable in whether AI delivers real work or expensive noise. The model matters, the platform matters — but the prompt you write determines whether you get a first

A
Agency Script Editorial
June 1, 2026·10 min read
General

Counting the Real Cost of Every Token You Send

Tokens and context windows sit at the intersection of AI capability and operational cost—yet most business cases treat them as technical footnotes. That's a mistake that costs real money. Every time y

A
Agency Script Editorial
June 1, 2026·10 min read
General

Rolling Out AI Hallucinations Across a Team

Most teams discover AI hallucinations the hard way — a confident-sounding wrong answer makes it into a client deliverable, a legal brief, or a published report. The damage isn't just to the output; it

A
Agency Script Editorial
June 1, 2026·11 min read

Ready to certify your AI capability?

Join the professionals building governed, repeatable AI delivery systems.

Explore Certification