haidlir
/

bloom-chatml-id

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Notebook Info

Reference:

Task: Chat or Conversational

Input: User's prompt containing chat templated text in string format

Output: Model's generated text in string format

Experiment:

Use bos_token and eos_token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
Use left padding and left truncation to conform to max_length.
Set max_length = 256 in the training process, which consumes 33.7 GB of memory.

Notebook:

https://drive.google.com/file/d/11FiaWxGt2HxUirZrHTNLaVmiqrUwejwV/view?usp=drive_link

Downloads last month: 7

Safetensors

Model size

1.07B params

Tensor type

F32

·

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train haidlir/bloom-chatml-id

Space using haidlir/bloom-chatml-id 1