CausalLM
/

35b-beta-long

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Edit model card

TBA

Tokenizer is different from cohere - and chat template is ChatML - fully fine-tuned at 128K+

No loras, no quants, no tricks, 30M+ sft data.

Pressure Testing from: https://github.com/LeonEricsson/llmcontext

Downloads last month: 2,601

Safetensors

Model size

35B params

Tensor type

BF16

·

Datasets used to train CausalLM/35b-beta-long

Collection including CausalLM/35b-beta-long

Farewell Gifts

5 items • Updated Apr 13 • 1