chlee10
/

T3Q-Platypus-Mistral7B

Text Generation

Open-platypus-Commercial

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

T3Q-Platypus-Mistral7B / README.md

chlee10's picture

Update README.md

155c575 verified 4 months ago

|

No virus

1.22 kB

	---
	pipeline_tag: text-generation
	license: apache-2.0
	language:
	- en
	tags:
	- Open-platypus-Commercial
	base_model: bardsai/jaskier-7b-dpo-v6.1
	datasets:
	- kyujinpy/Open-platypus-Commercial
	model-index:
	- name: T3Q-Platypus-Mistral7B
	results: []
	---
	Update @ 2024.03.07

	## T3Q-Platypus-Mistral7B

	This model is a fine-tuned version of bardsai/jaskier-7b-dpo-v6.1

	Model Developers Chihoon Lee(chlee10), T3Q

	## Training hyperparameters

	The following hyperparameters were used during training:

	```python
	# 데이터셋과 훈련 횟수와 관련된 하이퍼 파라미터
	batch_size = 16
	num_epochs = 1
	micro_batch = 1
	gradient_accumulation_steps = batch_size // micro_batch

	# 훈련 방법에 대한 하이퍼 파라미터
	cutoff_len = 4096
	lr_scheduler = 'cosine'
	warmup_ratio = 0.06 # warmup_steps = 100
	learning_rate = 4e-4
	optimizer = 'adamw_torch'
	weight_decay = 0.01
	max_grad_norm = 1.0

	# LoRA config
	lora_r = 16
	lora_alpha = 16
	lora_dropout = 0.05
	lora_target_modules = ["gate_proj", "down_proj", "up_proj"]

	# Tokenizer에서 나오는 input값 설정 옵션
	train_on_inputs = False
	add_eos_token = False

	# NEFTune params
	noise_alpha: int = 5
	```