Gradient-based Parameter Selection for Efficient Fine-Tuning Paper • 2312.10136 • Published Dec 15, 2023 • 1
Cross-modal Information Flow in Multimodal Large Language Models Paper • 2411.18620 • Published about 1 month ago • 2
Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective Paper • 2411.18615 • Published about 1 month ago • 1
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Paper • 2406.19999 • Published Jun 28 • 3
The LAMBADA dataset: Word prediction requiring a broad discourse context Paper • 1606.06031 • Published Jun 20, 2016
Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis Paper • 2305.11993 • Published May 19, 2023