Edit model card

Notebook Info

Reference:

Task: Chat or Conversational

Input: User's prompt containing chat templated text in string format

Output: Model's generated text in string format

Experiment:

  • Use bos_token and eos_token to replace <|im_start|> and <|im_end|> in ChatML. (Inspired by: https://asmirnov.xyz/doppelganger)
  • Use left padding and left truncation to conform to max_length.
  • Set max_length = 256 in the training process, which consumes 33.7 GB of memory.

Notebook:

Downloads last month
7
Safetensors
Model size
1.07B params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train haidlir/bloom-chatml-id

Space using haidlir/bloom-chatml-id 1