Aleph-Alpha 's Collections

Tfree-HAT-7b-pretrained

Tokenizer free models based on Hierarchical Autoregressive Transformer (https://arxiv.org/abs/2501.10322) trained from scratch.