Itential MCP

Designing MCP Servers That Don’t Become a New Mess

William Collins

Director of Technical Evangelism ‐ Itential

Designing MCP Servers That Don’t Become a New Mess

Designing MCP Servers That Don’t Become a New Mess

April 10, 2026
William Collins

Director of Technical Evangelism ‐ Itential

Designing MCP Servers That Don’t Become a New Mess

Quick Summary

MCP server design best practices go far beyond wrapping APIs – the teams who win with Model Context Protocol will be the ones who design curated, intention-level capabilities, not the ones who expose the most tools. This post breaks down six principles for building MCP servers that scale safely in production infrastructure operations: outcome-first tool design, aggressive surface area curation, bounded context resources, prompt discipline, deterministic guardrails, and production-grade failure handling.

AutoCon 4 in Austin was one of those conferences that felt equal parts exciting and unnerving.

Exciting because AI tooling has crossed the threshold from “cool demo” into “teams are shipping this.”

Unnerving because you could hear the same question behind a lot of hallway conversations:

Is this about to become another integration mess?

In my session, MCP Beyond the API Wrapper, I framed MCP (Model Context Protocol) as a way to reduce the N × M integration explosion we are walking into as multiple models collide with multiple tools and data systems.

But after the talk, one question kept coming up:

Okay, but how do we design MCP servers so they actually help, instead of becoming a standardized version of the same old sprawl?

That’s what this post is about. (You can check out my first post here, where I unpacked why MCP isn’t “just an API wrapper” and why it changes the integration math for model-to-tool connectivity.)

The Thesis: MCP Is a Standard, Your Capability Design Is the Differentiator

MCP gives us a consistent interface layer between models (clients) and tools and data systems (servers). That alone is valuable. It makes tool discovery and interaction more structured and portable across AI clients.

But the protocol itself does not guarantee a good outcome.

If you treat MCP server design as an API wrapper exercise, you get:

  • hundreds of granular “tools”
  • ambiguous selection behavior
  • bloated context windows
  • higher failure rates
  • more operational risk

If you treat MCP server design as a product problem you get something different, like the Itential MCP Server:

  • curated, intention-level capabilities
  • predictable tool selection
  • bounded context
  • deterministic execution around probabilistic reasoning
  • a path toward governance

So let’s talk about how to build MCP servers that scale.

1. Start With Outcomes, Not Endpoints

The easiest way to build an MCP server is to map endpoints to tools. That is also the fastest way to make MCP feel like “just another abstraction layer.”

Because now your model has to behave like an integration engineer: call tool A, then tool B, then tool C, interpret tool D’s output, then decide which tool to use next.

That’s not what you want a model doing in production operations.

Instead, start with the outcomes you want the model to achieve safely:

  • “check compliance for a device group against a policy”
  • “summarize drift and risk against golden intent”
  • “stage a change and generate a rollout plan”
  • “propose remediation within constraints”

When you expose outcomes, you shift complexity out of the model and back into deterministic automation logic, where it belongs.

Rule of thumb: If a tool requires the model to call 5 other tools to produce a meaningful operational result, you exposed the wrong level of abstraction.

2. Curate the Tool Surface Area Aggressively

MCP is built for discovery. That is one of its strengths. It’s also one of its sharpest edges.

If your MCP server exposes “everything,” you are creating a choice architecture that models struggle to navigate consistently. High tool count tends to create:

  • tool confusion
  • increased token consumption
  • higher failure rates
  • inconsistent paths to the same outcome

Instead, curate for:

  • clarity: tools should be distinct, not overlapping
  • minimalism: fewer tools, better descriptions
  • predictability: small action space yields better selection behavior
  • safety: separate read tools from write tools, and make writes explicit

This is where MCP server design becomes a discipline, not an implementation detail. The number of tools you don’t expose matters as much as the ones you do.

3. Resources Are Context Contracts, Not Data Dumps

MCP supports resources so that models can retrieve context without invoking actions. This is powerful, but it’s easy to misuse.

A bad pattern is exposing giant resources: full inventories, full configs, broad logs, entire databases. That doesn’t “help the model.” It overwhelms it.

A better pattern is treating resources as bounded context contracts that align to decision-making:

  • scoped inventory for a target group
  • relevant config fragments
  • the minimum policy text needed for compliance reasoning
  • just enough context to make a safe decision

Keep resources small and purposeful. Large context increases hallucination risk because the model is forced to reason over noise.

4. Prompts Are Your Standard Operating Procedures for Models

In MCP, prompts are not just convenience templates. They are one of the most practical mechanisms you have to standardize operational behavior across clients and use cases.

Think of prompts as SOPs for models. This is where you encode:

  • how to interpret results
  • what validation steps to run
  • how to handle uncertainty
  • when to stop and escalate
  • what “done” actually means

If you want reliability, prompt discipline matters as much as tool design.

5. Guardrails: Deterministic Control Around Probabilistic Reasoning

Here’s the tension that defines AI operations:

Models reason probabilistically. Infrastructure must execute deterministically.

MCP does not eliminate this. But it gives you a structured interface where controls can be consistently applied.

At minimum, production MCP servers need:

  • least privilege access
  • scoped credentials and token discipline
  • separation of read and write tools
  • auditability of tool execution
  • explicit side-effect disclosure for any state-changing action

OWASP’s MCP security guidance highlights risks like over-privileged tool exposure, secret leakage, and unsafe tool invocation – and it reinforces why strong access control and context isolation are foundational.

A practical pattern that works well: make reads easy, make writes expensive, require validation gates before execution, and use orchestration for deterministic workflows rather than agent improvisation.

6. Production Reliability Comes Down to Behavior Under Failure

The difference between a demo and production is always the same: what happens when it fails?

In operational environments, MCP tools need:

  • structured error reporting
  • clear failure modes
  • progress updates for long-running operations
  • consistent logging and observability
  • predictable responses that clients can recover from

Without this, models will compensate. They will guess. They will retry. They will loop. That’s how you turn a minor tool failure into an agent incident.

What “Good” Looks Like

A well-designed MCP server usually has:

  • a small set of intent-level tools
  • resources that provide scoped context
  • prompts that encode operational discipline
  • explicit side effects and constraints
  • least privilege tool exposure and auditability
  • production-grade failure behavior

Most importantly: the model is not integrating your systems. Your automation and orchestration platform is. 

The model decides the intent. The platform executes deterministically with guardrails. That division of labor is what scales.

MCP Standardizes Access. Capability Design Standardizes Outcomes.

MCP is rapidly becoming a default way for models to discover and use tools. The protocol’s value is portability and consistency across the AI ecosystem.

But the teams who win with MCP will not be the ones who expose the most tools. They will be the ones who design the best capabilities – purposeful, curated, safe, observable, and aligned to operational outcomes.

Because infrastructure doesn’t care that the operator is a model. It still demands control, predictability, and accountability.

Want to see the AutoCon 4 session?

If you want the full talk that sparked this post, you can watch the on-demand session here or below.

And if you’re exploring how MCP fits into real orchestration workflows, the AutoCon 4 workshop repo is a great starting point for understanding the practical patterns teams are already using.

William Collins

Director of Technical Evangelism ‐ Itential

William Collins is a strategic thinker and a catalyst for innovation, adept at navigating the complexities of both startups and large enterprises. With a career centered on scalable infrastructure design, he serves as Itential’s Director of Technical Evangelism. Here, he leads the charge in network automation, leveraging his deep roots in cloud architecture and network engineering. William hosts The Cloud Gambit Podcast, diving into cloud computing strategy, markets, and emerging trends with industry experts. Outside of transforming networks, you can find him enjoying time with family, playing ice hockey, and strumming guitar.

More from William Collins