如何扩展到16K上下文呢

by plancktree - opened Nov 20, 2023

Nov 20, 2023

介绍里说的可由4K上下文扩展到16K如何扩展？

CodeFuse AI org Nov 29, 2023

•

Try linear PI by modifying config.json and add
"rope_scaling": {
"factor": 4.0,
"type": "linear"
}
This works without training.
Modify config.json and change rope_theta to 100,000 or 1,000,000, which is the ROPE base. Then you should do long context (16k, 32k) finetuning.

chencyudel changed discussion status to closed Nov 29, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment