asiansoul
/

Joah-Llama-3-KoEn-8B-Coder-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

asiansoul commited on May 11, 2024

Commit

f92eb9d

·

verified ·

1 Parent(s): 810de9a

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -24,6 +24,7 @@ tags:
 "좋아(Joah)" by AsianSoul
 ## Merge Details
@@ -37,6 +38,13 @@ Don't worry even if you don't get the results you want.
 I'll find the answer for you.
 ### Merge Method
 This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.

 "좋아(Joah)" by AsianSoul
 ## Merge Details
 I'll find the answer for you.
+Soon PoSE to extend Llama's context length to 64k with using my merge method :  "reborn"[reborn](https://medium.com/@puffanddmx82/reborn-elevating-model-adaptation-with-merging-for-superior-nlp-performance-f604e8e307b2)
+256k is not possible. My computer is running out of memory. If you support me, i will try it on a computer with maximum specifications.
+If you support me, i would like to conduct great tests by building a network with high-capacity traffic and high-speed 10G speeds for you.
 ### Merge Method
 This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.