Multi - FFT vectorization for the cell multicore processor

J. Barhen, T. Humble, P. Mitra, M. Traweek

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

The emergence of streaming multicore processors with multi-SIMD architectures and ultra-low power operation combined with real-time compute and I/O reconfigurability opens unprecedented opportunities for executing sophisticated signal processing algorithms faster and within a much lower energy budget. Here, we present an unconventional FFT implementation scheme for the IBM Cell, named transverse vectorization. It is shown to outperform (both in terms of timing or GFLOP throughput) the fastest FFT results reported to date in the open literature.

Original languageEnglish
Title of host publicationCCGrid 2010 - 10th IEEE/ACM International Conference on Cluster, Cloud, and Grid Computing
Pages780-785
Number of pages6
DOIs
StatePublished - 2010
Event10th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2010 - Melbourne, VIC, Australia
Duration: May 17 2010May 20 2010

Publication series

NameCCGrid 2010 - 10th IEEE/ACM International Conference on Cluster, Cloud, and Grid Computing

Conference

Conference10th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2010
Country/TerritoryAustralia
CityMelbourne, VIC
Period05/17/1005/20/10

Keywords

  • FFT
  • IBM cell
  • Multicore processors
  • Transverse vectorization

Fingerprint

Dive into the research topics of 'Multi - FFT vectorization for the cell multicore processor'. Together they form a unique fingerprint.

Cite this