Collections
Discover the best community collections!
Collections including paper arxiv:2406.19380
-
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Paper • 2405.07526 • Published • 17 -
Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach
Paper • 2405.15613 • Published • 13 -
A Touch, Vision, and Language Dataset for Multimodal Alignment
Paper • 2402.13232 • Published • 13 -
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
Paper • 2406.11813 • Published • 30
-
End-to-End Object Detection with Transformers
Paper • 2005.12872 • Published • 5 -
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Paper • 2404.06209 • Published • 4 -
TabReD: A Benchmark of Tabular Machine Learning in-the-Wild
Paper • 2406.19380 • Published • 47 -
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper • 2407.09025 • Published • 128
-
FMViT: A multiple-frequency mixing Vision Transformer
Paper • 2311.05707 • Published • 5 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 118 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 85