-
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 47 -
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 131 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 68 -
hustvl/Vim-tiny
Updated • 17
Michael Schock
mjschock
AI & ML interests
None yet
Organizations
Collections
1
spaces
1
models
23
mjschock/leagaleasy-llama-3-instruct-v1
Updated
mjschock/llama-7b-qlora-ultrachat
Updated
mjschock/mamba-130m-peft-lora
Updated
mjschock/mamba-130m-ppo
Text Generation
•
Updated
mjschock/mamba-130m
Feature Extraction
•
Updated
•
4
•
1
mjschock/mamba-1.4b
Text Generation
•
Updated
•
1
mjschock/mamba-790m
Text Generation
•
Updated
•
4
mjschock/mamba-370m
Text Generation
•
Updated
•
1
mjschock/zephyr-7b-sft-qlora
Updated
mjschock/MobileVLM-1.7B
Updated
datasets
None public yet