Platform Overview · June 2026

Microsoft Fabric Overview — Complete 2026 Guide

Architecture, OneLake, all eight workloads, Fabric IQ GA, GPU-accelerated Warehouse, storage tiers, Fabric Apps, capacity model, and everything that changed through June 2026 — all verified against Microsoft Learn and official build announcements.

Quick Answer

Microsoft Fabric is a unified, SaaS-based analytics platform that combines Data Engineering, Data Factory, Data Warehouse, Real-Time Intelligence, Data Science, Power BI, Fabric IQ, and Fabric Apps under one capacity bill and one storage layer — OneLake. All data persists in open Delta Parquet format, enabling Power BI’s Direct Lake mode to query data at import speed with zero data movement. As of June 2026, Fabric has 35,000 paid customers with 60% YoY revenue growth, Fabric IQ reached GA at Build 2026, GPU-accelerated Data Warehouse is available (up to 7× faster, SIGMOD 2026 Best Industry Paper), and OneLake storage tiers (Hot/Cool/Cold) with lifecycle management are in Preview.

📅 Last verified: June 2026 ⏱ ~16 min read ✍️ A.J., Data Engineering Researcher & Technical Writer 🔗 Source: Microsoft Learn

Microsoft Fabric Overview – The Eight Fabric Workloads

Every Fabric workload shares the same OneLake storage, the same capacity billing pool, the same Entra ID identity, and the same Purview governance surface. No data movement between workloads is required.

🔀 Data Factory Pipelines (ADF-successor), Dataflow Gen2, Copy Job, dbt Job, Apache Airflow Jobs, and the ADF Migration Tool — the complete data integration surface. Read guide 🏞️ Data Engineering (Lakehouse) Delta tables in OneLake, Spark Runtime 1.3, Materialized Lake Views, Starter Pools, High Concurrency mode, and the SQL analytics endpoint. Start tutorial 🏛️ Data Warehouse Serverless distributed T-SQL with full ACID transactions. GPU-accelerated as of Build 2026 — up to 7× faster, SIGMOD 2026 Best Industry Paper. Deep dive 🔄 Dataflow Gen2 Visual Power Query over 200+ connectors. Writes Delta tables to OneLake via staged compute. Mapping Data Flow transformations now available inside Dataflow Gen2 (June 2026). Learn dataflows 🔁 Data Mirroring Near-real-time, zero-ETL replication from Azure SQL, Cosmos DB, Snowflake, and Databricks Unity Catalog into OneLake Delta tables. CDC for SQL estates now GA. Setup mirroring Real-Time Intelligence Eventstream, Eventhouse (KQL), Real-Time Hub, Data Activator, and Business Events. Sub-second streaming analytics without Spark cluster management. Stream data 🔬 Data Science PySpark and Scala notebooks, MLflow (cross-workspace experiment tracking GA), Spark ML, and integration with Azure Machine Learning and Foundry. Read guide 📊 Power BI Semantic models, Direct Lake mode, Copilot for report authoring, and default datasets auto-created from every Lakehouse and Warehouse. Optimize BI 🧠 Fabric IQ GA Operations Agents GA, Graph GA, Planning GA, Data Agent GA, and Ontology (Preview) — the shared context layer for AI agents and real-time intelligence. Full guide

The Paradigm Shift — Why Fabric

For the previous decade, data teams assembled “modern data stacks” by connecting separate PaaS services: Azure Data Factory for orchestration, Databricks for engineering, Synapse Analytics for warehousing, and Power BI for reporting. Each service had its own storage, its own security model, and its own billing. Data moved between them constantly — each hop creating copies, latency, and cost.

Fabric’s core architectural premise is that this fragmentation is the problem, not the individual tools. The platform is SaaS-delivered — Microsoft manages compute scaling, runtime upgrades, and infrastructure. Teams provision a Capacity (a pool of Compute Units) shared across all workloads, and all data lands in OneLake — once, in open Delta Parquet format, accessible to every engine without copying.

The OneCopy Principle

In a legacy stack, the same dataset physically exists multiple times: raw in ADLS, cleaned in Databricks, loaded into Synapse, imported into Power BI. That is four storage bills, four synchronisation jobs, four points of failure, and up to 24 hours of data latency by the time Power BI reports on it.

In Fabric: data lands in OneLake once. The Spark notebook cleans it in place. The SQL analyst queries the same file via T-SQL. Power BI reads that same file via Direct Lake. Zero movement. Zero duplication. The Fabric vs. Azure Synapse comparison covers the full technical and cost implications of this shift.

Legacy PaaS StackMicrosoft Fabric (SaaS)
Separate storage per service (ADLS, Databricks, Synapse, Power BI)Single OneLake — all engines read the same Delta Parquet files
Multiple billing meters — each service invoiced separatelyOne Capacity Unit (CU) pool shared by all workloads
Separate security models per service — permissions duplicatedUnified Entra ID identity and Purview governance across all workloads
Data movement between every layer — cost and latencyNo movement — compute comes to the data
CI/CD and Git per service — fragmented deploymentUnified Git integration (Azure Repos, GitHub) across all Fabric items
Field note — A.J., Data Engineering Researcher

The SaaS vs. PaaS framing matters most when it comes to runtime upgrades and platform incidents. In a PaaS stack, a Databricks runtime update is something the team controls and schedules. In Fabric, Microsoft manages it — which is faster for most teams and a governance concern for regulated industries that pin production workloads to specific runtime versions. For the latter, Databricks’ LTS runtime track is still the more appropriate choice. The honest answer is that Fabric’s SaaS model is a net advantage for 80% of enterprise teams and a genuine trade-off for the 20% with strict runtime stability requirements.

OneLake — The Storage Foundation

OneLake is the single, logical data lake for every Fabric tenant. It is built on top of ADLS Gen2 and supports all ADLS Gen2 APIs and SDKs — Azure Databricks, custom ADLS applications, and existing tooling connect without modification. The hierarchy is Tenant → Domain → Workspace → Item. Every workspace appears as a container; every Lakehouse, Warehouse, and Eventhouse appears as a folder within that container.

All Fabric items store tabular data automatically in Delta Parquet format. A Spark engineer writes to a Lakehouse. A SQL developer writes to a Warehouse. Both are writing Delta Parquet files to the same OneLake — contributing to the same unified data lake without any explicit integration between them.

Shortcuts — Virtualise External Data

Shortcuts link external storage (AWS S3, Google Cloud Storage, other ADLS Gen2 accounts, or other Fabric Lakehouses) into the OneLake namespace. To all Fabric compute engines, shortcut data appears local. There are no egress fees for reads within the Fabric compute surface, and no ETL job required to make external data queryable. The OneLake shortcut authorization fix guide covers the most common permission issues when setting up cross-workspace shortcuts.

OneLake Storage Tiers — June 2026 Preview

OneLake now supports three storage tiers, matching the Azure Storage hot/cool/cold model. As data volumes grow — particularly Bronze compliance archives — the ability to tier historical data automatically cuts storage costs without changing pipeline or query logic.

TierOptimised ForWhen to UseCost Trade-Off
Hot DefaultFrequently accessed dataActive Delta tables — Gold layer, Silver layer, any data queried by Spark, SQL endpoint, or Direct LakeStandard storage, lowest transaction cost
Cool PreviewInfrequently accessed dataHistorical partitions accessed monthly or less — older Bronze partitionsLower storage cost, higher transaction and retrieval cost than Hot
Cold PreviewLong-term compliance archivesData accessed less than once per year — regulatory retention archivesLowest storage cost, highest transaction and retrieval cost
Lifecycle management policies move files between tiers automatically based on creation date, last modified date, or last access date. Rules run in the background — no manual intervention after configuration. Source: Microsoft Learn — OneLake storage tiers. Critical: exclude _delta_log/ directories from any cooling rule — Delta transaction log files are read on every table open.

Delta and Iceberg — Open Formats

Fabric defaults to Delta Lake with V-Order write-time optimisation — a sort within Parquet files that dramatically improves read performance across all Fabric engines. As of 2026, OneLake also supports Apache Iceberg natively, enabling bidirectional access with Snowflake-managed Iceberg tables and interoperability with other Iceberg-native platforms. The Delta Lake vs. Iceberg comparison explains the trade-offs for teams evaluating both.

Compute Architecture

All Fabric compute is serverless — engines start on demand, process data in OneLake, and release resources when done. There are no VMs to manage, no cluster sizes to configure, no distribution keys to design. The three compute engines serve different workload types.

Polaris — Spark Runtime

  • Fabric Runtime 1.3 — Spark 3.5, Delta Lake 3.x
  • Starter Pools: 5–15 second cold start
  • High Concurrency mode (January 2026): up to 5 Lakehouse jobs share one Spark session
  • V-Order write-time optimisation enabled by default
  • Used by Notebooks, Spark Job Definitions, and Dataflow Gen2
🏛️

Polaris — SQL Engine

  • Distributed, serverless T-SQL — no distribution keys
  • Full ACID transactions and multi-table writes
  • GPU-accelerated as of Build 2026 — NVIDIA compute, up to 7× faster
  • SIGMOD 2026 Best Industry Paper for the underlying CoddSpeed research
  • Used by Data Warehouse and Lakehouse SQL analytics endpoint
⚙️

VertiPaq — Power BI Engine

  • Column-store in-memory engine behind Power BI semantic models
  • Direct Lake mode: reads Delta Parquet from OneLake directly without import
  • Shares the same F-SKU capacity pool as ETL and warehouse workloads
  • Sub-second interactive query performance on billions of rows when framing limits are not exceeded

For the Spark vs. SQL architectural decision — when to use a Lakehouse and when to use a Warehouse — the Lakehouse vs. Data Warehouse best practices guide covers the full decision framework. For Spark performance optimisation — shuffle tuning, partition strategy, broadcast joins — see the Spark shuffle and partitions guide.

Direct Lake — The Power BI Revolution

Direct Lake is the connection mode that makes Power BI in Fabric different from every prior approach. It allows the VertiPaq engine to load Delta Parquet columns from OneLake directly into memory on demand — without an import job, without a scheduled refresh, without a data copy. Reports reflect data as soon as the upstream ETL or Materialized Lake View refresh writes new files.

This closes the historical trade-off between Import mode (fast but stale) and DirectQuery (live but slow). Direct Lake delivers import-level performance on datasets of any size, including billions of rows, provided the model stays within the F-SKU’s memory framing limit.

🚨
Direct Lake Fallback — Silent Performance Trap

Direct Lake silently falls back to DirectQuery when: unsupported DAX functions are used, the semantic model exceeds the SKU framing limit, OneSecurity (OneLake data access roles preview) is enabled on the Lakehouse, or relationship cardinality is unsupported. When fallback occurs, report users see no error — only a performance degradation that looks like a slow report. Diagnose via Power BI Desktop Performance Analyzer — look for DirectQuery storage engine events in the log. See the Direct Lake fallback fix guide for full diagnostics.

For teams migrating from Power BI Premium P-SKUs, the Power BI Premium to Fabric migration guide covers SKU mapping, deadlines, and step-by-step migration methods. For Direct Lake performance optimisation aligned to the DP-600 exam, see the DP-600 Direct Lake optimisation guide.

Fabric IQ — The Shared Context Layer GA — Build 2026

Fabric IQ reached General Availability at Microsoft Build 2026 (June 2, 2026). It is the shared context layer that gives AI agents, analysts, and automated systems a consistent understanding of what the data means — not just where it is stored.

Fabric IQ ComponentStatusWhat It Does
Operations AgentsGAMonitor live data, apply Ontology business context, execute automated actions autonomously
Graph in FabricGARelationship-first modelling engine mapping connections between business entities and systems
Planning in FabricGA — June 2026Agent reasoning and multi-step task decomposition layer
Fabric Data AgentGAConversational NL → SQL/DAX/KQL over governed OneLake sources (up to five per agent)
OntologyPreviewBusiness entity, relationship, rule, and action definitions mapped to live OneLake data
Source: Microsoft Build 2026 Azure blog · Fabric IQ Build 2026 blog

Fabric IQ is also part of Microsoft IQ — the unified intelligence layer launched at Build 2026, combining Work IQ (M365 workplace context), Foundry IQ (enterprise knowledge retrieval), Fabric IQ Ontology (business semantics), and Web IQ (live web grounding). The full deep-dive is in the Microsoft Fabric IQ complete guide.

For AI and RAG patterns on top of OneLake data — building conversational interfaces grounded in enterprise data using Azure OpenAI, LangChain, or Semantic Kernel — see the Fabric RAG tutorial.

June 2026 — What Changed at Build & in the June Release

Build 2026 (June 2–3, 2026) and the June 2026 Feature Summary (June 2, 2026) together represent the largest single Fabric release in 2026. Key changes by workload:

June 2026 · Build 2026 GA Fabric IQ — Operations Agents, Graph, Planning reach GA

The shared context layer for AI agents is now production-ready. Operations Agents GA, Graph GA, Planning GA (later in June). Ontology remains in Preview, expected GA in coming months.

June 2026 · Build 2026 GPU-Accelerated Data Warehouse

NVIDIA GPU compute built into Fabric Data Warehouse. Up to 7× faster for reporting and application workloads vs. comparable vendors. SIGMOD 2026 Best Industry Paper for the underlying CoddSpeed research. UNC Health reports 5× query speed improvement in early customer testing.

June 2026 · Build 2026 Fabric Apps with Rayfin SDK — Preview

New workload: build and run full application backends directly on Fabric using the Rayfin SDK (built in partnership with Replit). App data lands in OneLake automatically, available to the full Fabric analytics stack. Static hosting from OneLake included. Perpetual free tier announced.

June 2026 · June Feature Summary OneLake Storage Tiers — Preview

Hot/Cool/Cold tiers with lifecycle management policies. Automatically moves historical Bronze data to lower-cost tiers based on creation date, last modified, or last access. Excludes Delta transaction logs from cooling rules — critical to avoid performance regressions.

June 2026 · June Feature Summary Data Agent Updates — Service Principals, Preview Runtime (NL2SQL v2), Code Interpreter

Service principal authentication for Data Agents (Preview) enables backend and automated workflow integrations. Preview Runtime with improved NL2SQL and higher query limits. Code Interpreter adds Python execution to agent responses for forecasting and statistical analysis. GPT 5.X model upgrades — ~20% accuracy improvement.

June 2026 · Build 2026 Data Factory — Mapping Data Flows in Dataflow Gen2, dbt Job API

ADF Mapping Data Flow transformations now available inside Dataflow Gen2 (week of June 8). dbt Job API for programmatic CI/CD control. ADF Migration Tool now includes MDF asset migration support. Variable Library connection and item references GA.

June 2026 · June Feature Summary Pipeline — Approval Activity, Conditional Retries, Copy Job Activity GA

Approval Activity (Preview) adds human-in-the-loop governance gates to pipelines. Conditional Retries (Preview) enable targeted failure recovery. Copy Job Activity reached GA in April 2026. Outbound access protection for all Data Factory items reached GA.

June 2026 · June Feature Summary Real-Time Intelligence — Business Events, Workspace Monitoring

Business Events introduces a unified pub-sub model for routing raw signals into business outcomes across the platform. Workspace Monitoring for Eventstream items via Eventhouse. Eventhouse SQL endpoint CI/CD support, time travel via SQL endpoint (both Preview).

June 2026 · June Feature Summary GitHub Enterprise Cloud with Data Residency — Git Integration GA

Fabric Git integration now supports ghe.com Enterprise Cloud instances with data residency requirements. Fabric items version-controlled within specific geographic boundaries for regulatory compliance.

June 2026 · June Feature Summary MLflow Cross-Workspace Logging — Data Science GA

Cross-workspace MLflow logging for experiments and models enables end-to-end MLOps from development through test to production using standard MLflow APIs. Supports importing ML assets from Azure Databricks, Azure Machine Learning, and any MLflow-compatible platform.

Capacity, Licensing & Economics

Fabric billing is based on a single metric: Capacity Units (CUs). You purchase a Fabric Capacity (F2 through F2048) through an Azure subscription, and all workloads — Spark notebooks, SQL warehouse queries, Power BI report rendering, pipeline runs, Real-Time Intelligence — draw from the same CU pool. This is the fundamental billing difference from the legacy stack, where each service had its own meter.

Smoothing and Bursting

Fabric allows bursting — consuming more CUs than the purchased capacity during short spikes — provided there is idle headroom available to “pay back” the burst within a 24-hour smoothing window. Interactive operations (loading a report, running a query) receive higher priority than background operations (pipeline refresh, Spark job) during CU contention. If the smoothing window is consistently exceeded, interactive operations are throttled. The Fabric capacity optimisation guide covers burst management, monitoring, and pause/resume scheduling in detail.

P-SKU Retirement — Non-EA Customers Passed End-of-Life

🔴
P-SKU End-of-Life — Action Required

Non-EA customers reached end-of-life on January 1, 2026. EA customers must migrate before their EA term ends, or by January 1, 2028 — whichever is earlier. After subscription ends: 30 days free equivalent capacity, then 90 days of throttled access, then data suspension. F64 is the functional equivalent of P1 — purchased through the Azure portal, not the Power BI admin portal. See the Power BI Premium to Fabric migration guide for SKU mapping, cost analysis, and step-by-step migration paths.

For cost estimation and workload-specific modelling, the Microsoft Fabric pricing calculator provides scenario-based estimates. For production stability and uptime strategy, see the production stability review.

Governance, Security & Domains

Fabric governance is built on Microsoft Entra ID for identity and Microsoft Purview for cataloguing, lineage, and sensitivity labels. The same Purview sensitivity labels applied in Microsoft 365 inherit automatically to Fabric items — no additional configuration.

Security Layers

LayerMechanismScope
Workspace RBACAdmin, Member, Contributor, Viewer roles via Entra IDControls who can edit Fabric items, notebooks, and pipelines in a workspace
Item permissionsReadAll, ReadData, Build permissions per itemGrant access to a specific Lakehouse or Report without exposing the full workspace
OneLake folder securityFolder-level access control within OneLake pathsSub-folder access for teams with different Bronze/Silver/Gold ownership boundaries
SQL Row-Level SecurityRLS defined in SQL analytics endpoint or semantic modelFilter rows based on user identity — regional data access, tenant isolation
Column maskingDynamic data masking in SQL analytics endpointMask PII columns from non-admin roles without changing ETL output
Customer-managed keysGA — OneLake encrypted with customer-owned Azure Key Vault keysRegulated industries requiring control over encryption key lifecycle
Outbound access protectionGA (June 2026) for all Data Factory itemsWorkspace-level control over which external endpoints pipelines can reach

Domains

Fabric Domains (Sales, Finance, HR, Operations) federate governance — business units manage their own data products while IT maintains central oversight. Data can be certified to signal trustworthiness to downstream consumers. The Fabric data governance tutorial covers the full domain, lineage, and sensitivity label configuration.

Migrating from Azure Synapse to Fabric

Fabric is the designated successor to Azure Synapse Analytics. Synapse remains supported, but all new capabilities — Fabric IQ, GPU warehouse, Direct Lake, Mirroring, Real-Time Intelligence, Fabric Apps — are Fabric-only. The migration decision for most teams is not whether to migrate, but when and in what order.

Synapse ComponentFabric EquivalentMigration Complexity
Serverless SQL PoolsLakehouse SQL analytics endpointLow — T-SQL compatible; Delta Parquet replaces Parquet-on-ADLS
Dedicated SQL PoolsFabric Data WarehouseMedium — T-SQL is compatible; no distribution keys to manage in Fabric; review partition strategies
Spark PoolsFabric Notebooks + Spark Job DefinitionsLow-Medium — PySpark code largely compatible; Fabric Runtime 1.3 (Spark 3.5)
ADF / Synapse PipelinesFabric Data Factory PipelinesLow — ADF Migration Tool (Preview, March 2026) migrates pipelines with no rewrites; MDF asset migration included (June 2026)
Power BI Premium datasetsDirect Lake semantic models on Fabric LakehouseMedium — Import mode replaced by Direct Lake; RLS moves to semantic model level
The full technical and cost comparison is in the Fabric vs. Azure Synapse full comparison.

Frequently Asked Questions

Yes. Fabric is the strategic successor to Azure Synapse Analytics. Azure Synapse remains supported but receives no new feature development — all innovation is in Fabric. Fabric provides faster Spark startup, unified OneLake storage, Fabric IQ, GPU-accelerated Warehouse, Git integration, and a single capacity billing model that Synapse does not offer. The ADF Migration Tool (Preview, March 2026) migrates ADF and Synapse pipelines to Fabric Data Factory with no pipeline rewrites required. Mapping Data Flow asset migration is included from the week of June 8, 2026. The full technical comparison is in the Fabric vs. Synapse guide.
Not necessarily. OneLake Shortcuts virtualise data from AWS S3, Google Cloud Storage, or existing ADLS Gen2 accounts into the Fabric namespace — no data movement, no egress charges. Data Mirroring replicates operational databases (Azure SQL DB, Cosmos DB, Snowflake, Databricks Unity Catalog) as live Delta tables in OneLake without custom ETL — CDC for SQL estates is now GA. To access Power BI Direct Lake mode — the primary performance advantage of Fabric for BI — data must be in Delta Parquet format within OneLake, either written by Fabric workloads or landed via Mirroring or Shortcuts.
F-SKUs (Fabric Capacity) are Azure pay-as-you-go capacity reservations purchased through the Azure portal. They support all eight Fabric workloads — Data Engineering, Data Factory, Data Warehouse, Real-Time Intelligence, Data Science, Power BI, Fabric IQ, and Fabric Apps. P-SKUs (Power BI Premium per capacity) are legacy Office 365 licences that only support Power BI workloads. Non-EA P-SKU customers reached end-of-life on January 1, 2026. EA customers must migrate by their EA term end or by January 1, 2028 — whichever is earlier. F64 is functionally equivalent to P1 for all Power BI features and additionally unlocks all Fabric workloads. F-SKUs also support pause and resume billing. See the Power BI Premium to Fabric migration guide for full SKU mapping and deadlines.
Yes. Fabric integrates with Microsoft Entra ID for identity, Microsoft Purview for data governance and lineage, and supports Private Links, VNET-compatible configurations, workspace RBAC, item-level permissions, OneLake folder security, row-level security, column masking, and customer-managed encryption keys (GA). Sensitivity labels from Microsoft 365 Purview inherit automatically to Fabric items. Outbound access protection for all Data Factory items reached GA in June 2026. GitHub Enterprise Cloud with data residency for Fabric Git integration is also GA. For the full governance architecture, see the Fabric data governance tutorial.
Fabric Apps is a new Fabric workload announced at Build 2026 (Preview, June 2026) that lets developers build and run full application backends directly on Fabric using the Rayfin SDK, built in partnership with Replit. Applications deployed via Rayfin store data directly in OneLake, making it immediately available to Lakehouses, Warehouses, semantic models, and AI engines — without separate ETL integration. Frontend assets are hosted via static hosting from OneLake. The app backend (API routing, Entra SSO authentication) does not incur separate CU charges. A perpetual free tier is planned. For teams building enterprise apps on top of Fabric data, this is the path from data to application without leaving the Fabric governance boundary.
⚠ Accuracy Disclaimer

Platform descriptions and feature status are based on official Microsoft Learn documentation, the Fabric June 2026 Feature Summary (June 2, 2026), and the Microsoft Build 2026 Azure blog. Preview feature timelines are subject to change. Verify current status at learn.microsoft.com/fabric and the Fabric Updates Blog. UIG Data Lab is an independent publication, not affiliated with or endorsed by Microsoft Corporation.

A.J. Data Engineering Researcher & Technical Writer · UIG Data Lab
A.J. researches and writes about data engineering, analytics architecture, Microsoft Fabric, and modern cloud data platforms. Coverage spans Microsoft Fabric, Power BI, Azure Data Engineering, Databricks, Snowflake, Apache Spark, dbt, Apache Airflow, and modern cloud data infrastructure. The focus is practitioner-level content that helps data professionals understand platform capabilities, evaluate technology decisions, optimise costs, and implement practical solutions using official documentation, product updates, community insights, and industry best practices. His writing covers real decisions from real deployments — not documentation rewrites.
Microsoft Fabric OneLake Direct Lake Fabric IQ Power BI Delta Lake Apache Spark Azure Data Engineering SaaS Analytics

Scroll to Top