Text Generation
Transformers
PyTorch
code
gpt2
custom_code
Eval Results
text-generation-inference

Question about model architecture

#40
by sh0416 - opened

Hello,

I'm just wondering that the architecture is different from starcoder.

Starcoder uses GPTBigCode, while this use custom architecture.

If it differs, could you elaborate details?

Thanks.

BigCode org

AFAIK Santa Coder was an early experiment. Please use the starcoder series models.

Sign up or log in to comment