sbintuitions
/

sarashina1-65b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sho-takase commited on Jun 10, 2024

Commit

3fa4cb4

·

verified ·

1 Parent(s): b083c4b

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -11,7 +11,9 @@ This repository provides Japanese language models trained by [SB Intuitions](htt
 ## How to use
-```
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline, set_seed
@@ -35,7 +37,7 @@ for t in text:
 ## Configuration
-| Parameters | Vocab size | Trainning tokens |  Architecture | Position type | Layers | Hidden dim | Attention heads |
 | :-----: | :-----------: | :-------------:  | :----------- | :-----------: | :----: | :--------: | :-------------: |
 | [7B](https://huggingface.co/sbintuitions/sarashina1-7b)      | 51200         | 1.0T             | GPTNeoX      | RoPE          | 32     | 4096       | 32 |
 | [13B](https://huggingface.co/sbintuitions/sarashina1-13b)     | 51200         | 1.0T             | GPTNeoX      | RoPE          | 40     | 5120       | 40 |

 ## How to use
+Please set **use_fast=False** to use our tokenizer properly.
+```python
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline, set_seed
 ## Configuration
+| Parameters | Vocab size | Training tokens |  Architecture | Position type | Layers | Hidden dim | Attention heads |
 | :-----: | :-----------: | :-------------:  | :----------- | :-----------: | :----: | :--------: | :-------------: |
 | [7B](https://huggingface.co/sbintuitions/sarashina1-7b)      | 51200         | 1.0T             | GPTNeoX      | RoPE          | 32     | 4096       | 32 |
 | [13B](https://huggingface.co/sbintuitions/sarashina1-13b)     | 51200         | 1.0T             | GPTNeoX      | RoPE          | 40     | 5120       | 40 |