--- license: apache-2.0 --- Use rwkv pip package 0.8.5+ for RWKV-5. Very interesting. RWKV-5 is great at benchmarks (excellent zeroshot performance), but generates quite worse music (just like GPT models) despite lower loss. This fits my theory: Dot-product is good for uncreative work, while Channelwise is good for creative work.