haidlir
/

bloom-chatml-id

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

haidlir commited on Jan 16

Commit

61ea9c1

•

1 Parent(s): 376034c

Update README.md (#2)

- Update README.md (6db5b387dc0536e8ffdb6e805971de02049dd79c)

Files changed (1) hide show

README.md +14 -6

README.md CHANGED Viewed

@@ -20,11 +20,19 @@ pipeline_tag: text-generation
 - https://huggingface.co/datasets/jakartaresearch/indoqa
-**Task**: Chat or Conversational
-**Input**: User's prompt containing chat templated text in string format
-**Output**: Model's generated text in string format
 **Experiment**:
-- Use bos and eos token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
-- Penggunaan padding dan truncation sesuai max_length.
-- Max length = 256, karena telah mengkonsumsi 33.7 GB.

 - https://huggingface.co/datasets/jakartaresearch/indoqa
+**Task**:
+Chat or Conversational
+**Input**:
+User's prompt containing chat templated text in string format
+**Output**:
+Model's generated text in string format
 **Experiment**:
+- Use bos_token and eos_token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
+- Use left padding and left truncation to conform to max_length.
+- Set max_length = 256 in the training process, which consumes 33.7 GB of memory.
+**Notebook**:
+- https://drive.google.com/file/d/11FiaWxGt2HxUirZrHTNLaVmiqrUwejwV/view?usp=drive_link