Could you please share the initial weights of one of the experts from jamba?
3
#4 opened 8 months ago
by
danielpark
Example Code for Initializing from Scratch
1
#3 opened 8 months ago
by
tanimazsin130
Fast Mamba kernels are not available. Make sure to they are installed and that the mamba module is on a CUDA device
1
#2 opened 8 months ago
by
BalajiAJ
[Request] Potential Release Of Training Code?
2
#1 opened 8 months ago
by
Lyte