arxiv:2407.02068
Kaixin Xu
kartmannXu
AI & ML interests
Neural Network Compression, Efficient AI
Recent Activity
updated
a model
24 days ago
kartmannXu/MiniCPM-2B-128k-pruned-0.3-onnx
authored
a paper
about 1 month ago
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep
Neural Networks
authored
a paper
about 1 month ago
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Organizations
None yet
Papers
3
models
12
kartmannXu/MiniCPM-2B-128k-pruned-0.3-onnx
Updated
•
14
kartmannXu/MiniCPM-2B-128k-q4f16_1_mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-bl-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-q4f16_2_mlc
Updated
datasets
None public yet