cortexso
/

openhermes-2.5

Inference Endpoints

Model card Files Files and versions Community

openhermes-2.5 / model.yml

jan-hq's picture

Create model.yml

6d230ea verified 5 months ago

535 Bytes

	name: openhermes-2.5
	model: openhermes-2.5:7B
	version: 1

	files:
	- llama_model_path: model.gguf

	# Results Preferences
	top_p: 0.95
	temperature: 0.7
	frequency_penalty: 0
	presence_penalty: 0
	max_tokens: 4096 # Infer from base config.json -> max_position_embeddings
	stream: true # true \| false

	# Engine / Model Settings
	ngl: 33 # Infer from base config.json -> num_attention_heads
	ctx_len: 4096 # Infer from base config.json -> max_position_embeddings
	engine: cortex.llamacpp
	prompt_template: "{system_message} [INST] {prompt} [/INST]"