harishnair04
/

gemma_instruct_medtr_2b

Text Generation

Model card Files Files and versions Community

gemma_instruct_medtr_2b / README.md

harishnair04's picture

Update README.md

e00cd3c verified about 2 months ago

|

929 Bytes

	---
	library_name: keras-hub
	pipeline_tag: text-generation
	license: apache-2.0
	datasets:
	- harishnair04/mtsamples
	language:
	- en
	base_model:
	- google/gemma-2-2b
	tags:
	- LoRA
	---
	This is a [`Gemma` model](https://keras.io/api/keras_hub/models/gemma) uploaded using the KerasHub library and can be used with JAX, TensorFlow, and PyTorch backends.
	This model is related to a `CausalLM` task.

	Model config:
	* name: gemma_backbone
	* trainable: True
	* vocabulary_size: 256000
	* num_layers: 18
	* num_query_heads: 8
	* num_key_value_heads: 1
	* hidden_dim: 2048
	* intermediate_dim: 32768
	* head_dim: 256
	* layer_norm_epsilon: 1e-06
	* dropout: 0
	* query_head_dim_normalize: True
	* use_post_ffw_norm: False
	* use_post_attention_norm: False
	* final_logit_soft_cap: None
	* attention_logit_soft_cap: None
	* sliding_window_size: 4096
	* use_sliding_window_attention: False