Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth Paper • 2103.03404 • Published Mar 5, 2021 • 2
LANISTR: Multimodal Learning from Structured and Unstructured Data Paper • 2305.16556 • Published May 26, 2023 • 2
Learned Feature Importance Scores for Automated Feature Engineering Paper • 2406.04153 • Published Jun 6, 2024 • 1
Metadata Conditioning Accelerates Language Model Pre-training Paper • 2501.01956 • Published Jan 3, 2025 • 1
Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth Paper • 2103.03404 • Published Mar 5, 2021 • 2
Learned Feature Importance Scores for Automated Feature Engineering Paper • 2406.04153 • Published Jun 6, 2024 • 1