MoonMath AI released an MIT-licensed bf16 forward attention kernel for AMD's MI300X GPU, written in HIP. The kernel beats AMD's own AITER v3 on every tested shape and rounding mode. Bare-metal access came from HotAisle, an AMD cloud provider. The kernel targets the gfx942 ISA only.
Tap to vote and see what everyone thinks.
Summary by ByteBrief
FishMonger's arsenal upgraded: SprySOCKS for Windows