RedPajama: an Open Dataset for Training Large Language Models
Paper
•
2411.12372
•
Published
•
47
None defined yet.
mamba
is now available in transformers. Thanks to
@tridao
and
@albertgu
for this brilliant model! 🚀 and the amazing mamba-ssm
kernels powering this!