Copyright (c) 2026 MindMesh Academy. All rights reserved. This content is proprietary and may not be reproduced or distributed without permission.
2.2.1. Choosing the Right Data Store
💡 First Principle: Fabric offers multiple data stores optimized for different workloads. Choosing correctly impacts performance, cost, and query capabilities.
Scenario: You need to store: (1) raw CSV files from external systems, (2) transformed relational data for SQL analysts, (3) real-time sensor data for operational dashboards. Each requires a different store.
Data Store Selection Guide
| Data Store | Best For | Query Language | Storage Format |
|---|---|---|---|
| Lakehouse | Big data, data science, flexible schema | Spark SQL, PySpark | Delta + Files |
| Data Warehouse | Structured analytics, SQL analysts | T-SQL | Delta |
| KQL Database | Real-time analytics, time-series | KQL | Optimized columnar |
| OneLake Files | Raw file storage, staging | N/A (file access) | Any |
Visual: Data Store Selection
Loading diagram...
⚠️ Common Pitfall: Storing structured data as files when a lakehouse table is more appropriate. Files require manual schema management; Delta tables provide schema enforcement, ACID transactions, and SQL access.