Text Generation
Transformers
Safetensors
English
llama
Not-For-All-Audiences
conversational
Inference Endpoints
text-generation-inference
TeeZee's picture
Upload 10 files
4822545 verified
metadata
language:
  - en
license: apache-2.0
tags:
  - not-for-all-audiences
datasets:
  - Intel/orca_dpo_pairs
  - athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW
  - Open-Orca/SlimOrca
  - MinervaAI/Aesir-Preview
  - allenai/ultrafeedback_binarized_cleaned

GALAXY-16B-v1.0

image/png

Technical notes

  • 72 layers,DUS procedure, mistral(32)->SOLAR(48)->GALAXY(72)
  • 16B parameters
  • model created as a extension of depth upscaling procedure used for SOLAR by upstage

Results

  • model can and will produce NSFW content
  • waiting for eval results