microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 9 days ago • 741k • 1.2k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • Updated 25 days ago • 1.71M • • 1.07k
SliceGPT: Compress Large Language Models by Deleting Rows and Columns Paper • 2401.15024 • Published Jan 26, 2024 • 73
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning Paper • 2303.02861 • Published Mar 6, 2023 • 2