Triton is a language and compiler for writing highly efficient custom deep-learning primitives. Not officially supported on Windows, but a fork provides pre-built wheels. 3.6.x RTX 50xx (Blackwell), ...