davidxmle commited on
Commit
0be80ae
1 Parent(s): 9a115a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -40,10 +40,11 @@ datasets:
40
  - Original Model creator: [Meta Llama from Meta](https://huggingface.co/meta-llama)
41
  - Original model: [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B)
42
  - Built with Meta Llama 3
43
- - Quantized by [Astronomer](https://astronomer.io)
44
 
45
  ## MUST READ: Very Important!! Note About Untrained Special Tokens in Llama 3 Base (Non-instruct) Models & Fine-tuning Llama 3 Base
46
- - Special tokens such as the ones used for instruct are undertrained in Llama 3 base models. (discovered by Daniel Han https://twitter.com/danielhanchen/status/1781395882925343058)
 
47
  - ![image/png](https://cdn-uploads.huggingface.co/production/uploads/655ad0f8727df37c77a09cb9/1U2rRrx60p1pNeeAZw8Rd.png)
48
  - A patch function is under way, fine-tuning this model for instruction following may cause `NaN` graidents unless this problem is addressed.
49
 
 
40
  - Original Model creator: [Meta Llama from Meta](https://huggingface.co/meta-llama)
41
  - Original model: [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B)
42
  - Built with Meta Llama 3
43
+ - Quantized by [David Xue](https://www.linkedin.com/in/david-xue-uva/) from [Astronomer](https://astronomer.io)
44
 
45
  ## MUST READ: Very Important!! Note About Untrained Special Tokens in Llama 3 Base (Non-instruct) Models & Fine-tuning Llama 3 Base
46
+ - Special tokens such as the ones used for instruct are undertrained in Llama 3 base models.
47
+ - Credits: discovered by Daniel Han https://twitter.com/danielhanchen/status/1781395882925343058
48
  - ![image/png](https://cdn-uploads.huggingface.co/production/uploads/655ad0f8727df37c77a09cb9/1U2rRrx60p1pNeeAZw8Rd.png)
49
  - A patch function is under way, fine-tuning this model for instruction following may cause `NaN` graidents unless this problem is addressed.
50