In-Depth Guide ยท Microsoft Fabric AI

Microsoft Fabric Data Agent โ€” Complete Guide (2026)

How to set up, configure, and deploy the Fabric Data Agent โ€” covering supported data sources, response limits, service principal authentication (preview), Copilot Studio and Microsoft Foundry integration, governance, and confirmed limitations. Verified against official Microsoft documentation through June 2026.

Quick Answer

The Microsoft Fabric Data Agent is a conversational AI interface that lets users ask natural language questions against Fabric data sources โ€” Lakehouse, Warehouse, KQL databases, and Power BI semantic models. You connect up to 5 data sources per agent. Responses are capped at 25 rows and 25 columns. The agent publishes an API endpoint callable from Copilot Studio, Microsoft Foundry, and custom apps. Requires F2 or higher Fabric capacity. Currently in public preview with service principal authentication added in June 2026.

๐Ÿ“… Last verified: June 2026 โฑ ~14 min read โœ๏ธ A.J., Data Engineering Researcher ๐Ÿ”— Source: Microsoft Learn
๐Ÿ”ถ
Public Preview

The Microsoft Fabric Data Agent is currently in public preview. Response results are capped at 25 rows and 25 columns. Service principal authentication was added as a preview feature in June 2026. Features and limitations will change as the product moves toward general availability. Verify current status at Microsoft Learn before any production deployment.

What Is the Microsoft Fabric Data Agent?

The Fabric Data Agent is a conversational AI layer built on top of your Fabric data sources. You configure it with data source connections and instructions, publish it, and users interact with it through a chat interface โ€” asking questions in plain English and receiving answers drawn directly from your governed Fabric data.

Under the hood, the agent uses Azure OpenAI Assistant APIs to parse each question, check user permissions, generate the appropriate SQL or KQL query, execute it against the connected sources, and format the result. Every step respects your existing row-level security, Purview governance policies, and workspace boundaries.

The agent is distinct from a dashboard or a report. It does not visualise data โ€” it answers questions about it. Think of it as a governed, permission-aware query interface that anyone on your team can use without SQL knowledge.

โ„น๏ธ
How It Sits Inside the Fabric AI Ecosystem

Fabric has two AI agent types that are easy to conflate. The Data Agent is reactive โ€” users ask questions, it queries data sources. The Operations Agent is proactive โ€” it monitors KQL databases on a 5-minute cycle and pushes recommendations to Teams. They solve different problems and can be used together in the same architecture.

Field note โ€” A.J., UIG Data Lab

The 25-row response cap is the most common source of confusion when teams first deploy the Data Agent. It is not a bug โ€” it is by design. The Data Agent is built for conversational insights, not data exports. If your use case requires returning full datasets, the right tool is a notebook, a pipeline, or a direct SQL endpoint โ€” not the Data Agent.

Fabric Data Agent โ€” Supported Data Sources

The Data Agent connects to data sources within your Fabric workspace. You can connect up to 5 sources per agent. Each source type has specific permissions and configuration requirements.

Data SourceWhat the Agent QueriesPermission RequiredSupported
LakehouseDelta tables via SQL analytics endpointRead access on the Lakehouseโœ… Yes
Data WarehouseTables and views via T-SQLRead access on the Warehouseโœ… Yes
KQL DatabaseTables via KQL queriesRead access on the KQL databaseโœ… Yes
Power BI Semantic ModelMeasures and tables via DAXRead (Write not required)โœ… Yes
External databasesNot directly accessibleMirror to Fabric firstโŒ No
Shortcut tablesDepends on shortcut typeVariesโš  Partial
SQL Database in FabricTables via T-SQLRead accessโœ… Yes
๐Ÿšจ
5-Source Limit Per Agent

Each Data Agent can connect to a maximum of 5 data sources. If your use case spans more than 5 sources, create multiple specialised agents โ€” one per domain โ€” rather than a single agent trying to do everything. Specialised agents also produce more accurate answers because the context is narrower.

Schema Quality Directly Impacts Answer Accuracy

For each connected source, the agent reads column names, table names, and any descriptions you provide. Ambiguous column names like col_a or val1 produce poor query generation. Descriptive names like net_revenue_usd or order_status_code produce reliable results. Time spent on schema documentation pays back in agent accuracy.

You can provide up to 100 example queries per data source. These are the highest-impact investment in agent quality โ€” real questions paired with correct expected answers teach the model how your business actually talks about its data.

How to Set Up the Fabric Data Agent

โš™๏ธ
Prerequisites
  • F2 or higher Fabric capacity (or P1 Premium with Fabric enabled). Trial capacities are not supported.
  • Admin or Member role in the Fabric workspace where the agent will live.
  • At least one Lakehouse, Warehouse, KQL database, or Power BI semantic model in the same workspace.
  • Copilot and AI features enabled at the tenant level by a Fabric admin.
  • Read access to all data sources you intend to connect.
  1. Open your Fabric workspaceNavigate to your workspace in the Fabric portal. Confirm your capacity is F2 or higher โ€” the Data Agent item will not appear on Trial capacities.
  2. Create a new Data Agent itemSelect + New item โ†’ Data Agent (under AI + Machine Learning). Give it a name that reflects the domain it covers, such as “Sales Analytics Agent” or “Finance Q&A Agent”.
  3. Connect data sourcesAdd up to 5 data sources. For each source, select specific tables rather than connecting entire databases โ€” this keeps the agent’s context tight and improves query accuracy.
  4. Add table and column descriptionsFor every table you connect, add a plain-English description of what it contains. For ambiguous columns, add inline descriptions. This is the single most impactful step for answer quality.
  5. Write agent instructionsDefine how the agent should interpret questions โ€” which source to prefer for which type of query, what business terms map to which columns, and any compliance constraints (e.g. never surface PII columns). See Section 04 for detail.
  6. Add example queries (up to 100 per source)Provide real business questions with the correct expected SQL or result format. Examples are the highest-signal training input โ€” use actual questions your team asks, not generic demos.
  7. Test in the chat interfaceAsk the agent your real questions. Check that table joins, filters, and metric calculations are correct. Refine instructions and examples based on failures before going further.
  8. Verify governance settingsConfirm that RLS filters on your data sources are working correctly. Test with accounts from different roles to confirm each user sees only their permitted data. Purview DLP policies are automatically enforced โ€” verify none will block expected queries.
  9. Publish the agentPublishing generates the API endpoint. This endpoint is what Copilot Studio, Microsoft Foundry, and custom applications call. Copy the workspace_id and artifact_id from the URL โ€” you’ll need them for integrations.
  10. Share accessGrant users Read permission on the agent item and on its underlying data sources. Both are required โ€” access to the agent alone is not sufficient.

Writing Effective Data Agent Instructions

Agent instructions are plain text that you provide during setup. They define how the agent interprets ambiguous questions, which source to prefer for which query type, and any business rules or compliance constraints the agent must follow.

What to Cover in Your Instructions

๐Ÿ“

Define Your Key Metrics

  • Revenue = net_revenue_usd (after returns and cancellations)
  • Margin = (net_revenue – cogs) / net_revenue ร— 100
  • Active customer = purchased within last 90 days
๐Ÿ—บ๏ธ

Map Source Priority

  • For current-month data: use the Lakehouse (refreshed hourly)
  • For historical trends: use the Warehouse (optimised for aggregations)
  • For pre-calculated KPIs: use the Power BI semantic model
๐Ÿท๏ธ

Define Synonyms

  • “Revenue” = net_revenue_usd
  • “Orders” = transactions table
  • “Customer” = unique_account_id
  • “SKU” = product_code
๐Ÿ”’

Set Compliance Rules

  • Never surface columns: email, phone, ssn, dob
  • Flag results where row count is below 10 (anonymisation risk)
  • Always round financial figures to two decimal places
โš ๏ธ
Instructions Are Not a Security Boundary

Agent instructions shape how the model interprets and responds to questions โ€” but they do not enforce data access. Real security comes from RLS on source tables, Purview DLP policies, and workspace permissions. Never rely on instructions alone to hide sensitive data. Enforce access at the data layer.

Field note โ€” A.J., UIG Data Lab

The most common instruction mistake is writing them too broadly โ€” something like “answer questions about sales data.” That tells the model nothing useful. The instructions that actually improve answer quality are specific: column-level definitions, source-priority rules for edge cases, and explicit handling of known ambiguities in your schema. Treat instructions the same way you would treat a data dictionary โ€” specific, precise, and reviewed by someone who knows the data.

Authentication, API Access & Service Principal Support

Authentication Model

The Data Agent enforces the calling user’s identity for data access. When a user asks a question, the agent queries the underlying sources using that user’s permissions โ€” so RLS and object-level security apply automatically based on who is asking, not who configured the agent.

Authentication TypeStatus (June 2026)Use Case
Delegated user token (interactive)โœ… GAIn-product chat, direct user sessions
Service principal authentication๐Ÿ”ถ PreviewBackend apps, automated workflows, Microsoft Foundry production deployments
Managed identityโŒ Not supportedโ€”
โœ…
Service Principal Support โ€” June 2026 Preview

Service principal authentication for the Data Agent was released as a preview feature in June 2026. Applications can now authenticate to the Data Agent using their own app identity โ€” no user sign-in required per request. This unlocks backend services, shared-backend multi-user apps, and production Microsoft Foundry deployments where requiring a delegated user token was a blocker.

Calling the Data Agent API

Once published, the Data Agent exposes an API endpoint. The URL contains two identifiers you copy from the agent’s URL in Fabric:

  • workspace_id โ€” the GUID of your Fabric workspace
  • artifact_id โ€” the GUID of the Data Agent item

The agent is callable from Copilot Studio as a plugin, from Microsoft Foundry as a tool using MicrosoftFabricAgentTool, and from custom applications using the Azure AI Projects SDK. SDKs are available for Python, TypeScript, and JavaScript. The Fabric Data Agent and the Foundry project must be in the same tenant.

Integrations โ€” Copilot Studio, Microsoft Foundry & Teams

Microsoft Foundry Integration

The most common production pattern uses the Fabric Data Agent as a tool inside a Microsoft Foundry agent. The Foundry agent handles multi-turn conversation, orchestration logic, and the user-facing experience. The Fabric Data Agent handles grounded data queries โ€” returning accurate, permission-aware answers from your enterprise Fabric sources.

During development, interactive browser authentication works. For production deployment to a web app or custom application, service principal authentication (preview, June 2026) is required โ€” the application authenticates with its own identity rather than routing every request through a user sign-in flow.

Copilot Studio Integration

You can register the published Data Agent endpoint as a plugin inside Copilot Studio. Users interact through Teams or a web interface โ€” Copilot Studio handles the conversation layer, and the Data Agent answers data-specific questions. This pattern works well for broad enterprise rollouts where the Data Agent covers one domain inside a larger company-wide assistant.

Teams Direct Access

For simpler deployments, the Data Agent can be accessed directly from within Fabric’s chat interface, shared as a workspace item, or embedded in Teams through Copilot Studio without building a custom application. This is the fastest path to getting the agent in front of users.

๐Ÿ“Œ
Improved Query Routing โ€” June 2026

The June 2026 Fabric update improved the Data Agent orchestrator’s ability to route queries to the correct data source, particularly in multi-source configurations. If you previously noticed the agent consistently querying the wrong source for certain question types, updating your instructions to reflect source priority is worth revisiting after this update.

Fabric Data Agent vs Operations Agent

These two agent types are often confused because both carry the Fabric “agent” label. They solve completely different problems.

๐Ÿง  Data Agent
ModeReactive โ€” user asks a question, agent answers
SourcesLakehouse, Warehouse, KQL, Power BI semantic models (up to 5)
TriggerUser query (on demand)
OutputText answer with data (max 25 rows, 25 columns)
InterfaceChat (Fabric, Copilot Studio, Foundry, Teams)
Primary userBusiness users, analysts, anyone with a question
ActionsNone โ€” answers only, no automated execution
โšก Operations Agent
ModeProactive โ€” monitors data, pushes alerts without being asked
SourcesKQL databases in Eventhouse only
Trigger5-minute polling cycle (continuous)
OutputTeams notification with recommended action
InterfaceMicrosoft Teams (Fabric Operations Agent app)
Primary userDevOps, SRE, Ops teams who need monitoring
ActionsPower Automate flows triggered on human approval

Use both together when your use case includes both reactive analytics and proactive monitoring. The Operations Agent fires an alert when an anomaly is detected; the Data Agent helps the team investigate by answering follow-up questions about the underlying data.

Fabric Data Agent โ€” Confirmed Limitations (June 2026)

These are confirmed, current constraints from official Microsoft documentation. Plan around them before deployment.

25 row / 25 column capEvery response is capped at 25 rows and 25 columns. This is by design โ€” the agent is for conversational insights, not full dataset export.
5 data sources per agentMaximum 5 connected sources per agent item. For broader coverage, create multiple specialised agents.
100 example queries per sourceYou can provide up to 100 example Q&A pairs per data source. These are the most impactful quality input available.
Service principal: preview onlyService principal authentication was added as a preview feature in June 2026. Not yet generally available โ€” validate thoroughly before production use.
No external database accessOnly Fabric-native sources are supported. External databases must be mirrored into Fabric first before the agent can query them.
Chat history affects resultsPrevious messages in the same session influence subsequent responses. If results are unexpectedly limited (e.g. from an earlier 25-row cap), start a new chat session.
Purview DLP can block queriesIf Purview Data Loss Prevention or access restriction policies apply to a source, the agent may return truncated or blocked results. Test DLP interaction before rollout.
Same tenant requirementFor Foundry integration, the Fabric Data Agent and the Foundry project must be in the same Microsoft tenant. Cross-tenant setups are not supported.
F2 minimum capacityTrial capacities are not supported. Requires F2 or higher, or Power BI Premium P1 or higher with Fabric enabled.
Conversational, not analyticalThe agent is not a replacement for notebooks, SQL endpoints, or BI tools for heavy analytical work. It is designed for conversational question-and-answer interactions.
โš ๏ธ
Still in Public Preview

The Data Agent is not generally available. Feature behaviour, response limits, and authentication support will change as the product evolves. Check the official concept page and the setup guide before any production deployment.

Fabric Data Agent FAQ

The Fabric Data Agent supports Lakehouse tables, Data Warehouse tables, KQL databases, Power BI semantic models, and SQL Databases in Fabric โ€” all within the same Fabric workspace. You can connect up to 5 sources per agent. External databases must be mirrored into Fabric first. Shortcut tables have partial support depending on the shortcut type.
Responses are capped at 25 rows and 25 columns. This is by design โ€” the agent is built for conversational insights, not full dataset retrieval. If you need complete data, use a notebook, a SQL endpoint, or a pipeline. Previous chat history can also affect what rows are returned in follow-up questions, so starting a new session is recommended when switching topics.
Service principal support was added as a preview feature in June 2026. Previously, only delegated user token (interactive sign-in) was supported. The preview enables backend applications to authenticate using their own app identity โ€” which is required for production Microsoft Foundry deployments, backend services, and shared-backend multi-user applications.
The Data Agent is reactive โ€” users ask it questions and it queries your data sources to answer. It supports Lakehouse, Warehouse, KQL, and Power BI sources. The Operations Agent is proactive โ€” it monitors KQL databases every 5 minutes, detects conditions you define, and sends recommendations to Microsoft Teams for human approval. They serve different purposes and work well alongside each other.
You need F2 or higher Fabric capacity, or Power BI Premium P1 or higher with Fabric enabled on that capacity. Trial capacities are explicitly not supported. The Data Agent consumes Capacity Units (CUs) for both AI processing and data queries โ€” track usage in the Microsoft Fabric Capacity Metrics app.
Yes. The Data Agent enforces RLS defined on the underlying source tables, object-level security (users can only query tables they have permission to see), and Purview DLP policies. All data access runs under the calling user’s identity โ€” not the agent creator’s. This means each user automatically sees only their permitted data, and no special configuration is needed to apply existing security models.
Yes. The Fabric Data Agent can be used as a tool inside a Microsoft Foundry agent using MicrosoftFabricAgentTool. The Foundry agent handles conversation orchestration while the Fabric Data Agent provides grounded, permission-aware answers from your Fabric data. The Data Agent and the Foundry project must be in the same tenant. For production deployments, service principal authentication (preview, June 2026) is required.
โš  Accuracy Disclaimer

This guide is verified against Microsoft Learn documentation and the Microsoft Fabric Community blog as of June 2026. The Fabric Data Agent is in public preview โ€” features, limits, and authentication support will change. Always check the official documentation before any production deployment. UIG Data Lab is an independent publication, not affiliated with or endorsed by Microsoft Corporation.

A.J. Data Engineering Researcher & Technical Writer ยท UIG Data Lab All articles โ†’
A.J. researches and writes about data engineering, analytics architecture, Microsoft Fabric, and modern cloud data platforms. Coverage spans Microsoft Fabric, Power BI, Azure Data Engineering, Databricks, Snowflake, Apache Spark, dbt, Apache Airflow, and modern cloud data infrastructure. The focus is practitioner-level content that helps data professionals understand platform capabilities, evaluate technology decisions, optimize costs, and implement practical solutions using official documentation, product updates, community insights, and industry best practices.
Microsoft Fabric AI Agents Real-Time Intelligence KQL Databricks Apache Spark dbt Azure Power BI Data Architecture

Scroll to Top