CausalLM
/

35b-beta-long

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

35b-beta-long / README.md

JosephusCheung's picture

Update README.md

a1b0465 verified 7 months ago

|

321 Bytes

TBA

Tokenizer is different from cohere - and chat template is ChatML - fully fine-tuned at 128K+

No loras, no quants, no tricks.

Pressure Testing from: https://github.com/LeonEricsson/llmcontext