Enhance speed by using nn.layernorm and nn.groupnorm (triton-lang/triton#5712) b1e34ec verified zhiyuan8 commited on 10 days ago