The Amazon EC2 Inf2 instance family, generally available since April 13, 2023, is designed for deep learning inference and powered by AWS Inferentia2 chips. These instances feature dual AMD EPYC 7R13 processors and offer up to 100 Gbps of networking bandwidth. They provide up to 384 GB of shared accelerator memory and EBS-only storage. Inf2 instances are optimized for generative AI models, delivering high throughput and low latency.