Alexandre Eichenberger

Title

Principal RSM, Compiler Optimization for Parallelism

Bio

Design and development of high-performance computing systems.

I work as a Principal Research Staff Member in the Z Research group at the IBM T.J. Watson Research Center. My research interests focus on the interaction between compiler technology and micro- architecture design.

My most recent works focus on accelerating Deep Neural Networks for CPUs as well as custom AI hardware accelerator such as the IBM Telum dedicated on-chip accelerator for AI inference. I am the lead of the Open-Source ONNX-MLIR project, which aims to lower ONNX neural net models in optimized code using the MLIR infrastructure as well as the LLVM optimizing backend. I am also on the ONNX steering committee representing IBM.

Prior works included work on OpenMP, GPU acceleration, multi-threading, and SIMD code generation.

Publications

Performance analysis of OpenMP on a GPU using a CORAL proxy application
- - Gheorghe-Teodor Bercea
  - Carlo Bertolli
  - et al.
- 2015
- PMBS/SC 2015
Integrating GPU support for OpenMP offloading directives into clang
- - Carlo Bertolli
  - Samuel F. Antao
  - et al.
- 2015
- LLVM-HPC 2015
Coordinating GPU threads for OpenMP 4.0 in LLVM
- - Carlo Bertolli
  - Samuel F. Antao
  - et al.
- 2014
- LLVM-HPC/SC 2014
Automatic creation of tile size selection models
- - Tomofumi Yuki
  - Lakshminarayanan Renganarayanan
  - et al.
- 2010
- CGO 2010
Compact multi-dimensional kernel extraction for register tiling
- - Lakshminarayanan Renganarayana
  - Uday Bondhugula
  - et al.
- 2009
- SC 2009
Exploiting parallelism with dependence-aware scheduling
- - Xiaotong Zhuang
  - Alexandre E. Eichenberger
  - et al.
- 2009
- PACT 2009
Hybrid Access-Specific Software Cache Techniques for the Cell BE Architecture
- - Marc Gonzàlez
  - Nikola Vujic
  - et al.
- 2008
- PACT 2008
Overview of the IBM Blue Gene/P project
- - Gheorghe Almasi
  - Sameh Asaad
  - et al.
- 2008
- IBM J. Res. Dev
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine™ architecture
- - Alexandre E. Eichenberger
  - John Kevin O'Brien
  - et al.
- 2006
- IBM Systems Journal
Efficient SIMD code generation for runtime alignment and length conversion
- - Peng Wu
  - Alexandre E. Eichenberger
  - et al.
- 2005
- CGO 2005

Visit Google Scholar

Top collaborators

Ramon Bertran Monfort

Senior Research Scientist @ Efficient and Resilient Systems Research Group

Alper Buyuktosunoglu

Principal Research Scientist

Alexandre Eichenberger

Title

Bio

Publications

Performance analysis of OpenMP on a GPU using a CORAL proxy application

Integrating GPU support for OpenMP offloading directives into clang

Coordinating GPU threads for OpenMP 4.0 in LLVM

Automatic creation of tile size selection models

Compact multi-dimensional kernel extraction for register tiling

Exploiting parallelism with dependence-aware scheduling

Hybrid Access-Specific Software Cache Techniques for the Cell BE Architecture

Overview of the IBM Blue Gene/P project

Using advanced compiler technology to exploit the performance of the Cell Broadband Engine™ architecture

Efficient SIMD code generation for runtime alignment and length conversion

Patents

Framework For Efficient Code Generation Using Loop Peeling For Simd Loop Code With Multiple Misaligned Statements

Efficient Data Reorganization To Satisfy Data Alignment Constraints

Framework For Integrated Intra- And Inter-loop Aggregation Of Contiguious Memory Accesses For Simd Vectorization

Method And Apparatus For Eliminating The Need For Register Assignment, Allocation, Spilling And Re-filling

Top collaborators

Ramon Bertran Monfort

Alper Buyuktosunoglu