SparseLLMs

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Raincleared new activity 16 days ago

SparseLLM/prosparse-llama-2-7b:Model not running on CPU, due to flash_attn package requirement.

Raincleared new activity 23 days ago

SparseLLM/ReluLLaMA-7B:Adding `safetensors` variant of this model

Raincleared new activity about 1 month ago

SparseLLM/sparsing-law-0.1b-relu:Adding `safetensors` variant of this model

View all activity

SparseLLM's activity

Raincleared

in SparseLLM/prosparse-llama-2-7b 16 days ago

Model not running on CPU, due to flash_attn package requirement.

#8 opened 18 days ago by

Akash1003

Raincleared

in SparseLLM/ReluLLaMA-7B 23 days ago

Adding `safetensors` variant of this model

#3 opened 23 days ago by

SFconvertbot

Raincleared

in SparseLLM/sparsing-law-0.1b-relu about 1 month ago

Adding `safetensors` variant of this model

#1 opened about 1 month ago by

SFconvertbot

demerzel-iv

authored a paper about 2 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

demerzel-iv

updated 8 models 2 months ago

Raincleared

authored a paper 2 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

demerzel-iv

updated a model 2 months ago

SparseLLM/sparsing-law-0.1b-silu

Text Generation • Updated Nov 5, 2024 • 5

Raincleared

updated a model 2 months ago

SparseLLM/sparsing-law-0.1b-relu

Text Generation • Updated about 1 month ago • 118 • 1

demerzel-iv

updated a model 2 months ago

SparseLLM/sparsing-law-0.1b-relu

Text Generation • Updated about 1 month ago • 118 • 1

Raincleared

authored a paper 4 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 28

ZhengyanZhang

authored a paper 4 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 28

ZhengyanZhang

authored a paper 7 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 23

yixinsong

authored a paper 7 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10, 2024 • 23

AI & ML interests

Recent Activity

Team members 7

SparseLLM's activity

Model not running on CPU, due to flash_attn package requirement.

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model