11 16 7

Chenyang Song

Raincleared

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

new activity 3 months ago

SparseLLM/prosparse-llama-2-7b:Model not running on CPU, due to flash_attn package requirement.

new activity 3 months ago

SparseLLM/ReluLLaMA-7B:Adding `safetensors` variant of this model

View all activity

Organizations

Raincleared's activity

upvoted a paper 8 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 9 days ago • 63

New activity in SparseLLM/prosparse-llama-2-7b 3 months ago

Model not running on CPU, due to flash_attn package requirement.

#8 opened 3 months ago by

Akash1003

New activity in SparseLLM/ReluLLaMA-7B 3 months ago

Adding `safetensors` variant of this model

#3 opened 3 months ago by

SFconvertbot

New activity in SparseLLM/sparsing-law-0.1b-relu 3 months ago

Adding `safetensors` variant of this model

#1 opened 3 months ago by

SFconvertbot

upvoted a paper 3 months ago

Densing Law of LLMs

Paper • 2412.04315 • Published Dec 5, 2024 • 19

New activity in openbmb/MiniCPM-S-1B-sft 4 months ago

Adding `safetensors` variant of this model

#1 opened 4 months ago by

SFconvertbot

authored a paper 4 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

updated a model 4 months ago

SparseLLM/sparsing-law-0.1b-relu

Text Generation • Updated Dec 12, 2024 • 27 • 2

upvoted a paper 4 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

commented a paper 4 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11 •

New activity in SparseLLM/prosparse-llama-2-7b 5 months ago

why does this model use FP32??

#6 opened 5 months ago by

purejomo

upvoted a paper 6 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 29

authored a paper 6 months ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 29

liked a model 8 months ago

mistralai/Mistral-Large-Instruct-2407

Updated Oct 16, 2024 • 9.86k • 825

updated a model 9 months ago

openbmb/MiniCPM-S-1B-sft-gguf

Updated Jul 4, 2024 • 29 • 6

updated a collection 9 months ago

MiniCPM

Collection

The MiniCPM family of LLMs and VLLMs. • 32 items • Updated Jan 19 • 67

liked a model 9 months ago

openbmb/MiniCPM-S-1B-sft-gguf

Updated Jul 4, 2024 • 29 • 6

upvoted a paper 9 months ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14

updated a collection 9 months ago

MiniCPM

Collection

The MiniCPM family of LLMs and VLLMs. • 32 items • Updated Jan 19 • 67

liked a model 9 months ago

openbmb/MiniCPM-S-1B-sft-llama-format

Text Generation • Updated Sep 7, 2024 • 19 • 4