Once a file is cached, it is served directly out of the cache to your compute instances or containers with consistent sub-millisecond latencies, up to hundreds of Gbps of throughput, and up to millions of IOPS. Your cache has a baseline throughput capacity of 1,000 MB/s per TiB of cache storage. If the requested data is not cached, it is copied to the cache from the linked data repository at speeds up to the baseline throughput capacity.
The cache’s throughput capacity is shared between clients accessing the cache and data movement between the cache and data repositories. For example, a 4.8 TiB cache with 4.8 GB/s of throughput can load files from on-premises file servers over a 10 Gbps (1.25 GB/s) connection while simultaneously supporting 3.55 GB/s of I/O from clients.
The rate at which Amazon File Cache copies files from your on-premises file server also depends on the bandwidth and roundtrip latency of your AWS Direct Connect/VPN link and the throughput supported by your on-premises file server. For the best performance, AWS Direct Connect is recommended.
See the Amazon File Cache Performance documentation for more details.