Elastic Fabric Adapter
Run HPC and ML applications at scale
Elastic Fabric Adapter (EFA)
Elastic Fabric Adapter (EFA) is a network interface for Amazon EC2 instances that enables customers to run applications requiring high levels of inter-node communications at scale on AWS. Its custom-built operating system (OS) bypass hardware interface enhances the performance of inter-instance communications, which is critical to scaling these applications. With EFA, High Performance Computing (HPC) applications using the Message Passing Interface (MPI) and Machine Learning (ML) applications using NVIDIA Collective Communications Library (NCCL) can scale to thousands of CPUs or GPUs. As a result, you get the application performance of on-premises HPC clusters with the on-demand elasticity and flexibility of the AWS cloud.
EFA is available as an optional EC2 networking feature that you can enable on any supported EC2 instance at no additional cost. Plus, it works with the most commonly used interfaces, APIs, and libraries for inter-node communications, so you can migrate your HPC applications to AWS with little or no modifications.
Benefits
EFA Performance

EFA provides a 4X improvement in scaling over ENA for a standard CFD simulation as shown in the chart above.
Solver for this benchmarking provided by Metacomp Technologies

AWS Customer CFD Direct
How it works

Use cases
Resources

Now Available – Elastic Fabric Adapter (EFA) for Tightly-Coupled HPC Workloads

AWS re:Invent 2018: Scaling HPC Applications on EC2 w/ Elastic Fabric Adapter

Deep Dive on OpenMPI and Elastic Fabric Adapter (EFA)
