Edit model card

GALAXY-16B-v1.0

image/png

Technical notes

  • 72 layers,DUS procedure, mistral(32)->SOLAR(48)->GALAXY(72)
  • 16B parameters
  • model created as a extension of depth upscaling procedure used for SOLAR by upstage

Results

  • model can and will produce NSFW content
  • waiting for eval results
Downloads last month
3
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train TeeZee/GALAXY-16B-v1.0-bpw6.5-h8-exl2

Collection including TeeZee/GALAXY-16B-v1.0-bpw6.5-h8-exl2