December 06, 2024
BLADE: Energy-Efficient Attention Accelerator with Fused Kernel and Bit-Level Redunda...
Zhiwei Lin, Yubin Qin, Jiachen Wang, et al.