QAttn: Efficient GPU Kernels for mixed-precision vision transformersPiotr Sebastian KluskaAdrián Castellóet al.2024CVPR 2024