Intel 5th Gen Xeon "Emerald Rapids" AVX-512 Performance

Written by Michael Larabel in Processors on 5 January 2024 at 10:35 AM EST. Page 4 of 5. 18 Comments.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 16, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 16, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 16, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 64, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 64, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.
TensorFlow benchmark with settings of Device: CPU, Batch Size: 64, Model: ResNet-50. Emerald Rapids: AVX-512 On was the fastest.

While AVX-512 started off with much criticism over power and thermal implications, the latest Intel (and AMD) server processors with AVX-512 continue showing off very meaningful gains and without those early pain points.

OpenVINO benchmark with settings of Model: Face Detection FP16, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Person Detection FP16, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Person Detection FP32, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Vehicle Detection FP16, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Weld Porosity Detection FP16, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Road Segmentation ADAS FP16-INT8, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Weld Porosity Detection FP16-INT8, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.
OpenVINO benchmark with settings of Model: Age Gender Recognition Retail 0013 FP16-INT8, Device: CPU. Emerald Rapids: AVX-512 On was the fastest.

AVX-512 and AMX continue to be very impactful for Intel's OpenVINO AI toolkit.


Related Articles