Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Paper • 2406.15275 • Published Jun 21 • 11
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17 • 30
Gradient Ascent Post-training Enhances Language Model Generalization Paper • 2306.07052 • Published Jun 12, 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records Paper • 2301.07695 • Published Jan 16, 2023 • 1
Exploring the Benefits of Training Expert Language Models over Instruction Tuning Paper • 2302.03202 • Published Feb 7, 2023 • 1
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning Paper • 2305.14045 • Published May 23, 2023 • 5
Aligning Large Language Models through Synthetic Feedback Paper • 2305.13735 • Published May 23, 2023 • 1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners Paper • 2210.02969 • Published Oct 6, 2022
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation Paper • 2401.06591 • Published Jan 12 • 3
Contextualized Sparse Representations for Real-Time Open-Domain Question Answering Paper • 1911.02896 • Published Nov 7, 2019 • 1
Zero-Shot Dense Video Captioning by Jointly Optimizing Text and Moment Paper • 2307.02682 • Published Jul 5, 2023 • 1
Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts Paper • 2209.12711 • Published Sep 26, 2022
Knowledge Unlearning for Mitigating Privacy Risks in Language Models Paper • 2210.01504 • Published Oct 4, 2022
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index Paper • 1906.05807 • Published Jun 13, 2019 • 1
Towards Continual Knowledge Learning of Language Models Paper • 2110.03215 • Published Oct 7, 2021 • 1