SEWYLM 2

New architecture using a blend of the following:

  • nGPT
  • LOCONUT (Limited COCONUT) (variation of COCONUT)
  • Gemma2
  • Differential Transformer
  • NeuTRENO

As of 16th dec. 2024, you need to use my library to use this model

SewyLM

link if not visible https://github.com/AarushCodes/SewyLM

LICENSE

GNU GPL v3

Downloads last month
131
Safetensors
Model size
187M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.