ooibp
's Collections
LLM Papers
updated
Attention Is All You Need
Paper
•
1706.03762
•
Published
•
34
BERT: Pre-training of Deep Bidirectional Transformers for Language
Understanding
Paper
•
1810.04805
•
Published
•
11
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
•
1910.01108
•
Published
•
9
Language Models are Few-Shot Learners
Paper
•
2005.14165
•
Published
•
9
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper
•
2201.11903
•
Published
•
7
Training language models to follow instructions with human feedback
Paper
•
2203.02155
•
Published
•
11
PaLM: Scaling Language Modeling with Pathways
Paper
•
2204.02311
•
Published
•
1
The Flan Collection: Designing Data and Methods for Effective
Instruction Tuning
Paper
•
2301.13688
•
Published
•
8
LLaMA: Open and Efficient Foundation Language Models
Paper
•
2302.13971
•
Published
•
11
Paper
•
2303.08774
•
Published
•
3
Paper
•
2305.10403
•
Published
•
4
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper
•
2305.10601
•
Published
•
7
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
•
2307.09288
•
Published
•
233
Attention Is Not All You Need Anymore
Paper
•
2308.07661
•
Published
•
1
Paper
•
2310.06825
•
Published
•
41
Gemini: A Family of Highly Capable Multimodal Models
Paper
•
2312.11805
•
Published
•
44
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Paper
•
2403.05530
•
Published
•
49
Gemma: Open Models Based on Gemini Research and Technology
Paper
•
2403.08295
•
Published
•
41
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
•
2404.14619
•
Published
•
117