Copyright (c) 2026 MindMesh Academy. All rights reserved. This content is proprietary and may not be reproduced or distributed without permission.

2.4.4. Shared Storage: EFS and FSx Selection and Tuning

šŸ’” First Principle: EBS gives you a dedicated disk attached to one instance. But what if multiple instances need to share the same files? Shared file systems solve the concurrent access problem — multiple servers reading and writing to the same storage, with each seeing the same data in real time.

This matters for applications that weren't designed for distributed storage: content management systems storing user uploads, application servers sharing configuration files, or batch processing jobs reading from the same input dataset.

Amazon EFS (Elastic File System):
  • NFS protocol (Linux only)
  • Scales automatically to petabytes — no capacity planning
  • Multi-AZ by default (data replicated across AZs)
  • Thousands of concurrent connections from EC2, Lambda, ECS, EKS
EFS Performance Modes:
ModeUse CaseTradeoff
General PurposeDefault; web serving, content management, home directoriesBest latency; some IOPS limit
Max I/OBig data, media processing, >500 clientsHigher aggregate throughput; higher per-operation latency
EFS Throughput Modes:
ModeHow Throughput Works
Elastic (Recommended)Automatically scales up and down; pay per GB transferred
BurstingThroughput tied to storage size; earns credits when idle, spends during bursts
ProvisionedYou specify throughput regardless of storage size; use when you need consistent high throughput with small storage

EFS Lifecycle Policies move infrequently accessed files to the cheaper EFS Infrequent Access storage class. Files that haven't been accessed in 7, 14, 30, 60, or 90 days (you choose) are automatically moved. Files accessed again are moved back to standard storage.

Amazon FSx — When EFS Isn't Enough:
FSx VariantProtocolBest For
FSx for Windows File ServerSMBWindows applications, Active Directory integration, NTFS
FSx for LustreLustre (custom)High-performance computing, ML training, financial modeling — maximum throughput
FSx for NetApp ONTAPNFS, SMB, iSCSIHybrid cloud; existing NetApp workloads
FSx for OpenZFSNFSZFS features, data compression, deduplication
Key Decision Rule:

āš ļø Exam Trap: FSx for Lustre integrates natively with S3 — you can configure it to automatically load data from an S3 bucket and write results back to S3. This is the recommended pattern for HPC workloads processing data from S3. EFS does not have this native S3 integration.

Reflection Question: A media company runs Linux-based video transcoding on a cluster of 200 EC2 instances. Source videos are in S3. The cluster needs shared access to partially processed files with throughput exceeding 50 GB/s. Which storage service do you choose and why?

Alvin Varughese
Written byAlvin Varughese
Founder•15 professional certifications