« Back
INT-FlashAttention: Enabling Flash Attention for INT8 Quantization
arxiv.org
Submitted by PaulHoule 2 days ago