FloatX: A C++ library for customized floating-point arithmeticGoran FlegarFlorian Scheideggeret al.2019ACM TOMS
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa SegmentationThomas GrutzmacherHartwig Anztet al.2018IA3 2018
Systematic derivation of time and power models for linear algebra kernels on multicore architecturesCristiano MalossiYves Ineichenet al.2015SUSCOM
Performance and Energy-Aware Characterization of the Sparse Matrix-Vector Multiplication on Multithreaded ArchitecturesCristiano MalossiYves Ineichenet al.2014ICPPW 2014
Deriving dense linear algebra librariesPaolo BientinesiJohn A. Gunnelset al.2013Formal Aspects of Computing
The science of deriving dense linear algebra algorithmsPaolo BientinesiJohn A. Gunnelset al.2005ACM TOMS