File size: 462 Bytes
915c013 e187fae 915c013 e187fae 915c013 e187fae 915c013 e187fae 915c013 d5263ed |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |
---
library_name: transformers
tags: []
---
# Malaysian Llama-3 8B 65536 context length
65536 context length and 15300000 RoPE Theta.
WanDB, https://wandb.ai/huseinzol05/EasyContext-65536?nw=nwuserhuseinzol05
Source code, https://github.com/mesolitica/malaya/tree/master/session/llama3#extend-1m-context-length
Special thanks to https://github.com/jzhang38/EasyContext for wrapping https://github.com/zhuzilin/ring-flash-attention for distributed training!
|