--- license: apache-2.0 --- Very interesting. RWKV-5 is great at benchmarks (excellent zeroshot performance), but generates quite worse music despite lower loss. This fits my theory: Dot-product is good for uncreative work, while Channelwise is good for creative work.