thrunlab (ThrunLab )

Genghan

authored a paper 2 months ago

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published Jan 27 • 8

Genghan

authored a paper 9 months ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 32

Genghan

authored a paper 10 months ago

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21, 2024 • 15

vxbrandon

updated 13 models 11 months ago

vxbrandon

updated 4 models 12 months ago

thrunlab/relu_mistral_7b_refined_web_relu_2024-04-14

Text Generation • Updated Apr 14, 2024 • 4

thrunlab/sparse_mistral_7b_refined_web_90p_2024-04-14

Updated Apr 14, 2024

thrunlab/sparse_mistral_7b_refined_web_70p_2024-04-14

Updated Apr 14, 2024

thrunlab/sparse_mistral_7b_refined_web_50p_2024-04-14

Text Generation • Updated Apr 14, 2024 • 5

ThrunLab

AI & ML interests

thrunlab's activity

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

thrunlab/sparse_llama_7b_hf2_refined_web_50p_2024-05-14

thrunlab/sparse_llama_7b_hf2_refined_web_90p_2024-05-12

thrunlab/sparse_llama_7b_hf2_refined_web_70p_2024-05-12

thrunlab/sparse_llama_7b_hf2_refined_web_50p_2024-05-12

thrunlab/relu_mistral_7b_refined_web_relu_2024-05-11

thrunlab/sparse_mistral_7b_refined_web_50p_2024-05-11

thrunlab/cats_exp

thrunlab/sparse_llama_7b_hf2_refined_web_70p_2024-05-11

thrunlab/sparse_llama_7b_hf2_refined_web_50p_2024-05-11

thrunlab/sparse_mistral_7b_refined_web_90p_2024-05-11

thrunlab/sparse_mistral_7b_refined_web_70p_2024-05-11

thrunlab/relu_llama_7b_hf2_refined_web_relu_2024-05-11

thrunlab/sparse_llama_7b_hf2_refined_web_90p_2024-05-11

thrunlab/relu_mistral_7b_refined_web_relu_2024-04-14

thrunlab/sparse_mistral_7b_refined_web_90p_2024-04-14

thrunlab/sparse_mistral_7b_refined_web_70p_2024-04-14

thrunlab/sparse_mistral_7b_refined_web_50p_2024-04-14

AI & ML interests

Team members 3

thrunlab's activity