Share some FFT benchmark results

Recently I got interested into FFT again :slight_smile: There are some results on the forum, about 5 years ago. So I think it is time to re-do some of them. I only benchmark complex-to-complex power-of-two forward single-precision FFT, but it should somehow be a good representative. The compiler is AppleClang. I might have made some mistakes, especially regarding FFTW on M chip (I have also tried brew install, similar results) and pffft. M chip uses NEON and Intel chip uses AVX2 (which kind of explains why the peak FLOPS are similar because M4 Pro has almost twice cycle speed).

In case someone also wants to try it: GitHub - zsliu98/zlfft: FFT implementation and analysis ยท GitHub

2 Likes

Awesome! Iโ€™d be interested to see some Windows benchmarks as well.

I would guess FFT performance is more chip-related? Though Intel IPP is definitely a strong choice :slight_smile:

1 Like

Add some results from Github runner images (fftw3 refers to using FFTW_MEASURE and fftw3_estimate refers to using FFTW_ESTIMATE).

1 Like