microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 5 days ago • 637k • 1.17k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 22 days ago • 1.64M • • 1.05k
SliceGPT: Compress Large Language Models by Deleting Rows and Columns Paper • 2401.15024 • Published Jan 26, 2024 • 72
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning Paper • 2303.02861 • Published Mar 6, 2023 • 2