How many rows does the Fabric Data Agent return per response?

Responses are capped at 25 rows and 25 columns. The Data Agent is designed for conversational insights, not for returning full datasets. If you need complete data exports, use direct SQL queries or notebooks instead.

What is the difference between Fabric Data Agent and Operations Agent?

The Data Agent is reactive — users ask it questions in natural language and it queries Fabric data sources to answer. The Operations Agent is proactive — it monitors KQL databases on a 5-minute polling cycle, detects conditions, and sends recommendations to Teams for human approval. They serve different purposes and can be used together.

What capacity does the Fabric Data Agent require?

You need an F2 or higher Fabric capacity, or Power BI Premium P1 or higher with Fabric enabled. Trial capacities are not supported. The Data Agent uses Capacity Units (CUs) for both AI processing and data queries.

Can the Fabric Data Agent integrate with Copilot Studio and Microsoft Foundry?

Yes. The Fabric Data Agent publishes an API endpoint that can be called from Copilot Studio as a plugin or from Microsoft Foundry as a tool. The Foundry agent handles conversation orchestration while the Fabric Data Agent answers data questions from governed Fabric sources.

In-Depth Guide · Microsoft Fabric AI

Microsoft Fabric Data Agent — Complete Guide (2026)

Q: Does the Fabric Data Agent support service principal authentication?

Service principal support for Fabric Data Agent was released as a preview feature in June 2026. Previously, only user identity authentication (delegated token) was supported. The preview enables applications to authenticate using their own app identity, which is required for backend services and production deployments.

How to set up, configure, and deploy the Fabric Data Agent — covering supported data sources, response limits, service principal authentication (preview), Copilot Studio and Microsoft Foundry integration, governance, and confirmed limitations. Verified against official Microsoft documentation through June 2026.

Quick Answer

The Microsoft Fabric Data Agent is a conversational AI interface that lets users ask natural language questions against Fabric data sources — Lakehouse, Warehouse, KQL databases, and Power BI semantic models. You connect up to 5 data sources per agent. Responses are capped at 25 rows and 25 columns. The agent publishes an API endpoint callable from Copilot Studio, Microsoft Foundry, and custom apps. Requires F2 or higher Fabric capacity. Currently in public preview with service principal authentication added in June 2026.

📅 Last verified: June 2026 ⏱ ~14 min read ✍️ A.J., Data Engineering Researcher 🔗 Source: Microsoft Learn

🔶

Public Preview

The Microsoft Fabric Data Agent is currently in public preview. Response results are capped at 25 rows and 25 columns. Service principal authentication was added as a preview feature in June 2026. Features and limitations will change as the product moves toward general availability. Verify current status at Microsoft Learn before any production deployment.

Section 01

What Is the Microsoft Fabric Data Agent?

The Fabric Data Agent is a conversational AI layer built on top of your Fabric data sources. You configure it with data source connections and instructions, publish it, and users interact with it through a chat interface — asking questions in plain English and receiving answers drawn directly from your governed Fabric data.

Under the hood, the agent uses Azure OpenAI Assistant APIs to parse each question, check user permissions, generate the appropriate SQL or KQL query, execute it against the connected sources, and format the result. Every step respects your existing row-level security, Purview governance policies, and workspace boundaries.

The agent is distinct from a dashboard or a report. It does not visualise data — it answers questions about it. Think of it as a governed, permission-aware query interface that anyone on your team can use without SQL knowledge.

ℹ️

How It Sits Inside the Fabric AI Ecosystem

Fabric has two AI agent types that are easy to conflate. The Data Agent is reactive — users ask questions, it queries data sources. The Operations Agent is proactive — it monitors KQL databases on a 5-minute cycle and pushes recommendations to Teams. They solve different problems and can be used together in the same architecture.

Field note — A.J., UIG Data Lab

The 25-row response cap is the most common source of confusion when teams first deploy the Data Agent. It is not a bug — it is by design. The Data Agent is built for conversational insights, not data exports. If your use case requires returning full datasets, the right tool is a notebook, a pipeline, or a direct SQL endpoint — not the Data Agent.

Section 02

Fabric Data Agent — Supported Data Sources

The Data Agent connects to data sources within your Fabric workspace. You can connect up to 5 sources per agent. Each source type has specific permissions and configuration requirements.

Data Source	What the Agent Queries	Permission Required	Supported
Lakehouse	Delta tables via SQL analytics endpoint	Read access on the Lakehouse	✅ Yes
Data Warehouse	Tables and views via T-SQL	Read access on the Warehouse	✅ Yes
KQL Database	Tables via KQL queries	Read access on the KQL database	✅ Yes
Power BI Semantic Model	Measures and tables via DAX	Read (Write not required)	✅ Yes
External databases	Not directly accessible	Mirror to Fabric first	❌ No
Shortcut tables	Depends on shortcut type	Varies	⚠ Partial
SQL Database in Fabric	Tables via T-SQL	Read access	✅ Yes

🚨

5-Source Limit Per Agent

Each Data Agent can connect to a maximum of 5 data sources. If your use case spans more than 5 sources, create multiple specialised agents — one per domain — rather than a single agent trying to do everything. Specialised agents also produce more accurate answers because the context is narrower.

Schema Quality Directly Impacts Answer Accuracy

For each connected source, the agent reads column names, table names, and any descriptions you provide. Ambiguous column names like col_a or val1 produce poor query generation. Descriptive names like net_revenue_usd or order_status_code produce reliable results. Time spent on schema documentation pays back in agent accuracy.

You can provide up to 100 example queries per data source. These are the highest-impact investment in agent quality — real questions paired with correct expected answers teach the model how your business actually talks about its data.

Section 03

How to Set Up the Fabric Data Agent

⚙️

Prerequisites

F2 or higher Fabric capacity (or P1 Premium with Fabric enabled). Trial capacities are not supported.
Admin or Member role in the Fabric workspace where the agent will live.
At least one Lakehouse, Warehouse, KQL database, or Power BI semantic model in the same workspace.
Copilot and AI features enabled at the tenant level by a Fabric admin.
Read access to all data sources you intend to connect.

Open your Fabric workspaceNavigate to your workspace in the Fabric portal. Confirm your capacity is F2 or higher — the Data Agent item will not appear on Trial capacities.
Create a new Data Agent itemSelect + New item → Data Agent (under AI + Machine Learning). Give it a name that reflects the domain it covers, such as “Sales Analytics Agent” or “Finance Q&A Agent”.
Connect data sourcesAdd up to 5 data sources. For each source, select specific tables rather than connecting entire databases — this keeps the agent’s context tight and improves query accuracy.
Add table and column descriptionsFor every table you connect, add a plain-English description of what it contains. For ambiguous columns, add inline descriptions. This is the single most impactful step for answer quality.
Write agent instructionsDefine how the agent should interpret questions — which source to prefer for which type of query, what business terms map to which columns, and any compliance constraints (e.g. never surface PII columns). See Section 04 for detail.
Add example queries (up to 100 per source)Provide real business questions with the correct expected SQL or result format. Examples are the highest-signal training input — use actual questions your team asks, not generic demos.
Test in the chat interfaceAsk the agent your real questions. Check that table joins, filters, and metric calculations are correct. Refine instructions and examples based on failures before going further.
Verify governance settingsConfirm that RLS filters on your data sources are working correctly. Test with accounts from different roles to confirm each user sees only their permitted data. Purview DLP policies are automatically enforced — verify none will block expected queries.
Publish the agentPublishing generates the API endpoint. This endpoint is what Copilot Studio, Microsoft Foundry, and custom applications call. Copy the workspace_id and artifact_id from the URL — you’ll need them for integrations.
Share accessGrant users Read permission on the agent item and on its underlying data sources. Both are required — access to the agent alone is not sufficient.

Section 04

Writing Effective Data Agent Instructions

Agent instructions are plain text that you provide during setup. They define how the agent interprets ambiguous questions, which source to prefer for which query type, and any business rules or compliance constraints the agent must follow.

What to Cover in Your Instructions

📐

Define Your Key Metrics

Revenue = net_revenue_usd (after returns and cancellations)
Margin = (net_revenue – cogs) / net_revenue × 100
Active customer = purchased within last 90 days

🗺️

Map Source Priority

For current-month data: use the Lakehouse (refreshed hourly)
For historical trends: use the Warehouse (optimised for aggregations)
For pre-calculated KPIs: use the Power BI semantic model

🏷️

Define Synonyms

“Revenue” = net_revenue_usd
“Orders” = transactions table
“Customer” = unique_account_id
“SKU” = product_code

🔒

Set Compliance Rules

Never surface columns: email, phone, ssn, dob
Flag results where row count is below 10 (anonymisation risk)
Always round financial figures to two decimal places

⚠️

Instructions Are Not a Security Boundary

Agent instructions shape how the model interprets and responds to questions — but they do not enforce data access. Real security comes from RLS on source tables, Purview DLP policies, and workspace permissions. Never rely on instructions alone to hide sensitive data. Enforce access at the data layer.

Field note — A.J., UIG Data Lab

The most common instruction mistake is writing them too broadly — something like “answer questions about sales data.” That tells the model nothing useful. The instructions that actually improve answer quality are specific: column-level definitions, source-priority rules for edge cases, and explicit handling of known ambiguities in your schema. Treat instructions the same way you would treat a data dictionary — specific, precise, and reviewed by someone who knows the data.

Section 05

Authentication, API Access & Service Principal Support

Authentication Model

The Data Agent enforces the calling user’s identity for data access. When a user asks a question, the agent queries the underlying sources using that user’s permissions — so RLS and object-level security apply automatically based on who is asking, not who configured the agent.

Authentication Type	Status (June 2026)	Use Case
Delegated user token (interactive)	✅ GA	In-product chat, direct user sessions
Service principal authentication	🔶 Preview	Backend apps, automated workflows, Microsoft Foundry production deployments
Managed identity	❌ Not supported	—

✅

Service Principal Support — June 2026 Preview

Service principal authentication for the Data Agent was released as a preview feature in June 2026. Applications can now authenticate to the Data Agent using their own app identity — no user sign-in required per request. This unlocks backend services, shared-backend multi-user apps, and production Microsoft Foundry deployments where requiring a delegated user token was a blocker.

Calling the Data Agent API

Once published, the Data Agent exposes an API endpoint. The URL contains two identifiers you copy from the agent’s URL in Fabric:

workspace_id — the GUID of your Fabric workspace
artifact_id — the GUID of the Data Agent item

The agent is callable from Copilot Studio as a plugin, from Microsoft Foundry as a tool using MicrosoftFabricAgentTool, and from custom applications using the Azure AI Projects SDK. SDKs are available for Python, TypeScript, and JavaScript. The Fabric Data Agent and the Foundry project must be in the same tenant.

Section 06

Integrations — Copilot Studio, Microsoft Foundry & Teams

Microsoft Foundry Integration

The most common production pattern uses the Fabric Data Agent as a tool inside a Microsoft Foundry agent. The Foundry agent handles multi-turn conversation, orchestration logic, and the user-facing experience. The Fabric Data Agent handles grounded data queries — returning accurate, permission-aware answers from your enterprise Fabric sources.

During development, interactive browser authentication works. For production deployment to a web app or custom application, service principal authentication (preview, June 2026) is required — the application authenticates with its own identity rather than routing every request through a user sign-in flow.

Copilot Studio Integration

You can register the published Data Agent endpoint as a plugin inside Copilot Studio. Users interact through Teams or a web interface — Copilot Studio handles the conversation layer, and the Data Agent answers data-specific questions. This pattern works well for broad enterprise rollouts where the Data Agent covers one domain inside a larger company-wide assistant.

Teams Direct Access

For simpler deployments, the Data Agent can be accessed directly from within Fabric’s chat interface, shared as a workspace item, or embedded in Teams through Copilot Studio without building a custom application. This is the fastest path to getting the agent in front of users.

📌

Improved Query Routing — June 2026

The June 2026 Fabric update improved the Data Agent orchestrator’s ability to route queries to the correct data source, particularly in multi-source configurations. If you previously noticed the agent consistently querying the wrong source for certain question types, updating your instructions to reflect source priority is worth revisiting after this update.

Section 07

Fabric Data Agent vs Operations Agent

These two agent types are often confused because both carry the Fabric “agent” label. They solve completely different problems.

🧠 Data Agent

ModeReactive — user asks a question, agent answers

SourcesLakehouse, Warehouse, KQL, Power BI semantic models (up to 5)

TriggerUser query (on demand)

OutputText answer with data (max 25 rows, 25 columns)

InterfaceChat (Fabric, Copilot Studio, Foundry, Teams)

Primary userBusiness users, analysts, anyone with a question

ActionsNone — answers only, no automated execution

⚡ Operations Agent

ModeProactive — monitors data, pushes alerts without being asked

SourcesKQL databases in Eventhouse only

Trigger5-minute polling cycle (continuous)

OutputTeams notification with recommended action

InterfaceMicrosoft Teams (Fabric Operations Agent app)

Primary userDevOps, SRE, Ops teams who need monitoring

ActionsPower Automate flows triggered on human approval

Use both together when your use case includes both reactive analytics and proactive monitoring. The Operations Agent fires an alert when an anomaly is detected; the Data Agent helps the team investigate by answering follow-up questions about the underlying data.

Section 08

Fabric Data Agent — Confirmed Limitations (June 2026)

These are confirmed, current constraints from official Microsoft documentation. Plan around them before deployment.

25 row / 25 column capEvery response is capped at 25 rows and 25 columns. This is by design — the agent is for conversational insights, not full dataset export.

5 data sources per agentMaximum 5 connected sources per agent item. For broader coverage, create multiple specialised agents.

100 example queries per sourceYou can provide up to 100 example Q&A pairs per data source. These are the most impactful quality input available.

Service principal: preview onlyService principal authentication was added as a preview feature in June 2026. Not yet generally available — validate thoroughly before production use.

No external database accessOnly Fabric-native sources are supported. External databases must be mirrored into Fabric first before the agent can query them.

Chat history affects resultsPrevious messages in the same session influence subsequent responses. If results are unexpectedly limited (e.g. from an earlier 25-row cap), start a new chat session.

Purview DLP can block queriesIf Purview Data Loss Prevention or access restriction policies apply to a source, the agent may return truncated or blocked results. Test DLP interaction before rollout.

Same tenant requirementFor Foundry integration, the Fabric Data Agent and the Foundry project must be in the same Microsoft tenant. Cross-tenant setups are not supported.

F2 minimum capacityTrial capacities are not supported. Requires F2 or higher, or Power BI Premium P1 or higher with Fabric enabled.

Conversational, not analyticalThe agent is not a replacement for notebooks, SQL endpoints, or BI tools for heavy analytical work. It is designed for conversational question-and-answer interactions.

⚠️

Still in Public Preview

The Data Agent is not generally available. Feature behaviour, response limits, and authentication support will change as the product evolves. Check the official concept page and the setup guide before any production deployment.

Section 09

Fabric Data Agent FAQ

What data sources does the Fabric Data Agent support?

The Fabric Data Agent supports Lakehouse tables, Data Warehouse tables, KQL databases, Power BI semantic models, and SQL Databases in Fabric — all within the same Fabric workspace. You can connect up to 5 sources per agent. External databases must be mirrored into Fabric first. Shortcut tables have partial support depending on the shortcut type.

How many rows does the Fabric Data Agent return?

Responses are capped at 25 rows and 25 columns. This is by design — the agent is built for conversational insights, not full dataset retrieval. If you need complete data, use a notebook, a SQL endpoint, or a pipeline. Previous chat history can also affect what rows are returned in follow-up questions, so starting a new session is recommended when switching topics.

Does the Fabric Data Agent support service principal authentication?

Service principal support was added as a preview feature in June 2026. Previously, only delegated user token (interactive sign-in) was supported. The preview enables backend applications to authenticate using their own app identity — which is required for production Microsoft Foundry deployments, backend services, and shared-backend multi-user applications.

What is the difference between the Data Agent and Operations Agent?

The Data Agent is reactive — users ask it questions and it queries your data sources to answer. It supports Lakehouse, Warehouse, KQL, and Power BI sources. The Operations Agent is proactive — it monitors KQL databases every 5 minutes, detects conditions you define, and sends recommendations to Microsoft Teams for human approval. They serve different purposes and work well alongside each other.

What Fabric capacity does the Data Agent require?

You need F2 or higher Fabric capacity, or Power BI Premium P1 or higher with Fabric enabled on that capacity. Trial capacities are explicitly not supported. The Data Agent consumes Capacity Units (CUs) for both AI processing and data queries — track usage in the Microsoft Fabric Capacity Metrics app.

Does the Data Agent enforce row-level security?

Yes. The Data Agent enforces RLS defined on the underlying source tables, object-level security (users can only query tables they have permission to see), and Purview DLP policies. All data access runs under the calling user’s identity — not the agent creator’s. This means each user automatically sees only their permitted data, and no special configuration is needed to apply existing security models.

Can the Data Agent work with Microsoft Foundry?

Yes. The Fabric Data Agent can be used as a tool inside a Microsoft Foundry agent using MicrosoftFabricAgentTool. The Foundry agent handles conversation orchestration while the Fabric Data Agent provides grounded, permission-aware answers from your Fabric data. The Data Agent and the Foundry project must be in the same tenant. For production deployments, service principal authentication (preview, June 2026) is required.

Section 10

Official Resources — Microsoft Learn Documentation

All links go directly to official Microsoft documentation. No third-party summaries.

📄

Fabric Data Agent Overview — Concept & Architecturelearn.microsoft.com · Architecture, supported sources, response limits, governance

↗ ⚙️

Create a Fabric Data Agent — Step-by-Step Setuplearn.microsoft.com · Prerequisites, configuration, permissions, publishing

↗ 🔗

Use the Fabric Data Agent with Microsoft Foundrylearn.microsoft.com · MicrosoftFabricAgentTool, Python + TypeScript SDK examples, authentication

↗ 🔐

Service Principal Support for Data Agents — June 2026 Previewcommunity.fabric.microsoft.com · What service principal unlocks, deployment flow, use cases

↗ ⚡

Microsoft Fabric Real-Time Intelligence Overviewlearn.microsoft.com · Operations Agent, Eventstream, Eventhouse, Activator

↗ 🤖

Operations Agent in Microsoft Fabric — Complete Guideultimateinfoguide.com · Setup, KQL sources, governance, limitations

↗ 🧠

Microsoft Fabric RAG Tutorial — Build Trustworthy AI on Your Dataultimateinfoguide.com · RAG architecture, indexing, grounding AI outputs in Fabric

↗ 🔧

Agentic Data Engineering Tutorialultimateinfoguide.com · How agentic workflows change pipeline design on Fabric

↗

⚠ Accuracy Disclaimer

This guide is verified against Microsoft Learn documentation and the Microsoft Fabric Community blog as of June 2026. The Fabric Data Agent is in public preview — features, limits, and authentication support will change. Always check the official documentation before any production deployment. UIG Data Lab is an independent publication, not affiliated with or endorsed by Microsoft Corporation.

A.J. Data Engineering Researcher & Technical Writer · UIG Data Lab All articles →

A.J. researches and writes about data engineering, analytics architecture, Microsoft Fabric, and modern cloud data platforms. Coverage spans Microsoft Fabric, Power BI, Azure Data Engineering, Databricks, Snowflake, Apache Spark, dbt, Apache Airflow, and modern cloud data infrastructure. The focus is practitioner-level content that helps data professionals understand platform capabilities, evaluate technology decisions, optimize costs, and implement practical solutions using official documentation, product updates, community insights, and industry best practices.

Microsoft Fabric AI Agents Real-Time Intelligence KQL Databricks Apache Spark dbt Azure Power BI Data Architecture