Skip to content

Topical cluster · for IT directors, principal architects, and senior engineers shaping AI systems

AI Architecture

AI architecture for Microsoft business apps means three things at once: the model layer (multi-provider, with cost and quality tradeoffs), the integration layer (Logic Apps, MCP, APIM, Foundry agents), and the data layer (Dataverse, AI Search, vector backends).

Articles

17

This cluster covers all three with a practitioner bias: real diagrams, real failure modes, real cost numbers. Logic Apps as MCP servers in production. Foundry agent patterns that scale beyond hello-world. Agentic development setups that ship.

If you are designing the AI fabric for a Microsoft-heavy enterprise, this is your starting point.

What this cluster covers

Subtopics in ai architecture

  • Azure AI landing zone reference architectures
  • Microsoft Foundry vs Azure OpenAI decision logic
  • MCP server patterns for Logic Apps and APIM
  • RAG over enterprise data (Dataverse, SharePoint, AI Search)
  • Multi-provider routing (Azure OpenAI, Claude, Foundry partner models)
  • Agentic development setups that survive production
  • Architecture diagrams that pass review

More in this cluster (16)

Azure AI Foundry New vs Classic: 2026 Migration Map

Azure AI Foundry New vs Classic: 2026 Migration Map

Azure AI Foundry new vs classic, decoded: the feature-parity matrix, what transfers in a hub migration, the two hard 2026 dates, and when to stay on classic.

Read
Foundry Hosted vs In-Process vs Copilot Studio Agents (2026 Decision)

Foundry Hosted vs In-Process vs Copilot Studio Agents (2026 Decision)

Foundry Hosted vs in-process vs Copilot Studio agents: a 2026 four-gate decision framework that picks the right path by who builds it and who runs it.

Read
AgentOps on Microsoft Foundry: A Practitioner Decode of the New CI/CD Reference Architecture (2026)

AgentOps on Microsoft Foundry: A Practitioner Decode of the New CI/CD Reference Architecture (2026)

Practitioner read of Microsoft's new Foundry CI/CD reference architecture: the 5-layer pipeline, evaluation-driven release gates, and where the architecture understates the operational work.

Read
From Assist to Execute: The Reference Architecture Implications Microsoft's Playbook Doesn't Draw (2026)

From Assist to Execute: The Reference Architecture Implications Microsoft's Playbook Doesn't Draw (2026)

The Assist-to-Execute shift in Microsoft's Agentic Patterns Playbook is the right conceptual move. This is the reference architecture implications the playbook stops short of drawing.

Read
The Six Agentic Adoption Patterns: A Practitioner Decode of Microsoft's New Playbook (2026)

The Six Agentic Adoption Patterns: A Practitioner Decode of Microsoft's New Playbook (2026)

A practitioner read of Microsoft's Agentic Transformation Patterns Playbook: six patterns, the 5x5 maturity model, CoE structures, what it understates.

Read
AI Orchestration for Legacy Systems: The Operational Front Door Pattern (2026)

AI Orchestration for Legacy Systems: The Operational Front Door Pattern (2026)

AI orchestration over 5-7 legacy systems with the systems of record unchanged. The Operational Front Door pattern, two-layer RAG, vendor-portable. CIO reference architecture.

Read
Enterprise AI Is More Than RAG: The Three Context Layers (2026)

Enterprise AI Is More Than RAG: The Three Context Layers (2026)

Enterprise AI is a context orchestration problem, not a retrieval problem. Three knowledge layers, three architecture levels, the operational truth distinction.

Read
Dataverse MCP, Business Skills, and Coding Agents: The 2026 Decode

Dataverse MCP, Business Skills, and Coding Agents: The 2026 Decode

Dataverse MCP server, Business Skills, Coding Agents plugin shipped May 5, 2026. Adopt-now-or-defer decision frame, five pilot gotchas, procurement surface.

Read
AI Proposal Writing on Foundry: Multi-Model Patterns That Ship

AI Proposal Writing on Foundry: Multi-Model Patterns That Ship

A four-phase proposal pipeline running on four different LLMs. Prompts, evaluator metrics, and the eval-suite that lets you swap models cleanly.

Read
Six Rules for LLM-Agnostic AI Agents on Microsoft Foundry

Six Rules for LLM-Agnostic AI Agents on Microsoft Foundry

Foundry Model Router routes 18 models across providers. Six rules for AI agents that survive the next vendor rotation, with a proposal-writing example.

Read
Azure AI Landing Zone: The 2026 Reference Architecture for IT Directors

Azure AI Landing Zone: The 2026 Reference Architecture for IT Directors

Microsoft says you do not need a separate AI landing zone. You need an application landing zone with networking, identity, and data wired right. Here is the 2026 reference architecture.

Read
Architecture Diagrams with Draw.io MCP Server and Claude Code

Architecture Diagrams with Draw.io MCP Server and Claude Code

Generate swimlanes, ERDs, and integration maps from text using Claude Code and the Draw.io MCP server. Free, git-friendly, no Visio needed.

Read
Logic Apps as MCP Servers - The Architecture That Actually Works

Logic Apps as MCP Servers - The Architecture That Actually Works

Turn Azure Logic Apps into MCP servers for AI agents. Two approaches, auth gotchas, cost math, and the architecture diagram Microsoft didn't draw.

Read
Agentic Development with Claude Code: The Setup That Actually Works

Agentic Development with Claude Code: The Setup That Actually Works

Build a multi-agent development environment with persistent memory, quality gates, and automated pipelines. The setup that turned one CLI tool into a content factory.

Read
Generating 20+ Architecture Diagrams in Minutes: A Batch-Generation Pattern

Generating 20+ Architecture Diagrams in Minutes: A Batch-Generation Pattern

Generate ERDs, network topologies, security models, CI/CD pipelines, and integration maps from code. The batch-generation pattern that replaces weeks of Visio work.

Read
Building AI Solutions on Azure: The Architecture That Actually Works

Building AI Solutions on Azure: The Architecture That Actually Works

Real Azure AI architecture with cost math, RAG patterns, and pricing traps that Microsoft's diagrams leave out.

Read

Common questions

AI Architecture FAQ

The questions that come up most often in ai architecture engagements. Answers grounded in Microsoft documentation and field experience.

Do I need a separate Azure AI landing zone?

No. Microsoft's Cloud Adoption Framework explicitly recommends deploying AI workloads into existing application landing zones, not a new top-level AI hierarchy. What changes inside the application landing zone is the networking (six private DNS zones, /22 spoke address space), identity (system-assigned managed identity, disable local key auth), and data plane (Purview, Defender CSPM, Sentinel hooks).

Should I migrate from Azure OpenAI to Microsoft Foundry?

Only if you need a Foundry-only feature: agents, non-OpenAI models, multi-agent workflows, Foundry IQ knowledge layer, or built-in tracing. The upgrade is opt-in and reversible. The Assistants API is the only hard deadline (retires August 26, 2026). For stable single-model GPT chat workloads with PTU reservations, defer the upgrade.

What is the right pattern for connecting Logic Apps to AI agents?

Logic Apps as MCP servers, fronted by Azure API Management for token gateway and rate limiting. The agent calls APIM, APIM authenticates and routes to the Logic App MCP endpoint, the Logic App orchestrates the underlying systems (Dataverse, ServiceNow, Salesforce). This isolates the agent from system-of-record idiosyncrasies.

When should I use a vector database vs Azure AI Search for RAG?

Use Azure AI Search for document-grounded RAG with enterprise Microsoft data (SharePoint, Dataverse, OneLake). Use a dedicated vector database (Pinecone, Qdrant, Cosmos DB for MongoDB vCore) when you need cross-region replication, sub-50ms latency, or vectors generated from non-text data. For most Microsoft-shop AI architectures, Azure AI Search is the default.