Microscopic-Mamba-2.1B
Collection
8 items
β’
Updated
Self trained microscopic Mamba. Around 2.1G parameters.
The tokenizer is the one from https://huggingface.co/state-spaces/mamba-2.8b-hf.
It is being trained on around 400B tokens and this is step 15.5k.
The evaluation is being conducted now.
This model is available under the Apache 2.0 License.
Join our Discord server here.
Eager to buy me a cup of 2$ coffe or iced tea?π΅β Sure, here is the link: https://ko-fi.com/drnicefellow. Please add a note on which one you want me to drink?