99
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Design of a modern fast Fourier transform and cache effective bit-reversal algorithm

& ORCID Icon
Pages 229-248 | Received 23 Jan 2023, Accepted 07 Feb 2023, Published online: 02 Mar 2023
 

ABSTRACT

This article deals with efficient vectorization of the fast Fourier transform algorithm while focusing on Cooley–Tukey versions with power-of-two radixes. Aside from examples of optimizations for 256 and 512-bit vectors, this work also discusses relations between individual radix-based versions, vectorization and OpenMP threading. Ideas are progressing into a timeless design of the FFT algorithm, which can work with any vector size and radix version through conversion into radix-2 output permutation. Furthermore, the implementation of the Cache Optimized Bit-Reversal algorithm, which doubles the performance of its predecessor, is introduced.

Data availability statement

The source codes of this project are available in the GitHub repository under the name SAFFT [Citation25].

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Grant Agency of the Czech Technical University in Prague [grant number SGS20/212/OHK3/3T/18].

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.