跳至正文
ThriftAttention: Selective Mixed Precision for Long-Context FP4 Attention