NewstaR
/

StableGalen-6b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

StableGalen-6b / README.md

baebee's picture

Update README.md

be0b964 about 1 year ago

|

581 Bytes

	---
	license: other
	datasets:
	- Photolens/MedText-DoctorLLaMa-OpenOrca-formatted
	- shibing624/medical
	language:
	- en
	tags:
	- medicine
	- doctor
	---
	# This model is the DeciLM-6b-Instruct model, trained specifically for medicine

	Galen uses the
	```
	### User: {prompt}

	### Response:
	```

	or

	```
	{prompt}
	```

	Prompt templates

	# Galen Training Recipe:
	- target_modules = ["q_proj", "v_proj", "gate_proj", "down_proj", "up_proj", "k_proj", "o_proj"]
	- Learning Rate: 4e-4
	- LR Scheduler: constant
	- 250 Steps
	<img src="Loss.png" alt="Loss" width="300" height="200" />

	## T3: 1 Hour