Upload complete model

c3620d0 verified 3 months ago

838 Bytes

metadata

license: apache-2.0
pipeline_tag: text-generation
library_name: mlx
tags:
  - vllm
  - mlx
base_model: openai/gpt-oss-120b

See gpt-oss-120b 6.5bit MLX in action - demonstration video

q6.5bit quant typically achieves 1.128 perplexity in our testing which is equivalent to q8 perplexity (1.128).

Usage Notes