File size: 462 Bytes
915c013
 
 
 
 
e187fae
915c013
e187fae
915c013
e187fae
915c013
e187fae
915c013
d5263ed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
library_name: transformers
tags: []
---

# Malaysian Llama-3 8B 65536 context length

65536 context length and 15300000 RoPE Theta.

WanDB, https://wandb.ai/huseinzol05/EasyContext-65536?nw=nwuserhuseinzol05

Source code, https://github.com/mesolitica/malaya/tree/master/session/llama3#extend-1m-context-length

Special thanks to https://github.com/jzhang38/EasyContext for wrapping https://github.com/zhuzilin/ring-flash-attention for distributed training!