Optimization algorithms for energy-efficient data centers
Hendrik F. Hamann
InterPACK 2013
This paper presents results on a communications-intensive kernel, the three-dimensional fast Fourier transform (3D FFT), running on the 2,048-node Blue Gene®/L (BG/L) prototype. Two implementations of the volumetric FFT algorithm were characterized, one built on the Message Passing Interface library and another built on an active packet Application Program Interface supported by the hardware bring-up environment, the BG/L advanced diagnostics environment. Preliminary performance experiments on the BG/L prototype indicate that both of our implementations scale well up to 1,024 nodes for 3D FFTs of size 128 × 128 × 128. The performance of the volumetric FFT is also compared with that of the Fastest Fourier Transform in the West (FFTW) library. In general, the volumetric FFT outperforms a port of the FFTW Version 2.1.5 library on large-node-count partitions. © Copyright 2005 by International Business Machines Corporation.
Hendrik F. Hamann
InterPACK 2013
Sai Zeng, Angran Xiao, et al.
CAD Computer Aided Design
Arun Viswanathan, Nancy Feldman, et al.
IEEE Communications Magazine
Frank R. Libsch, S.C. Lien
IBM J. Res. Dev