Continual Training - a imamnurby Collection

imamnurby 's Collections

Long Sequences for LLM

Attention in LLM

Graph Neural Network

MoE

General Purpose LLM

Continual Training

Chain of Thought

Instruction Tuning

Continual Training

updated Mar 14

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 48