Gemma 2b Instruction Fine-Tuned Model

Introduction

This repository contains the code and model for fine-tuning the Gemma 2b Instruction model using a dataset of 20k medium articles. The fine-tuned model is designed to generate responses based on input prompts related to instructional queries in English.

Model Details:

Model Name: Gemma 2b Instruct Medium FT
Framework: Keras with JAX backend
Model Type: Causal Language Model
Pre-trained Model: Gemma 2b Instruction Model

Fine-Tuning Process

The Gemma 2b Instruction Fine-Tuned Model was fine-tuned using the following process:

Data Preparation: 20k medium articles were used as the training dataset.
Model Configuration: Gemma 2b Instruction model was used as the base model.
Hyperparameters Tuning: AdamW optimizer was used with customized learning rates and weight decays.
Training: The model was trained for 2 epochs with a batch size of 2, Because of Less GPU resources of Kaggle It was finetuned for 5hrs.

Usage

Environment Setup

Ensure you have the necessary dependencies installed:

pip install keras keras-nlp

Inference

To use the fine-tuned model for inference, follow these steps:

Download the Model:
- The fine-tuned model file version_finetuned.keras can be downloaded from the provided link.
Inference:

from keras.models import load_model

# Load the saved model
loaded_model = load_model("version_finetuned.keras")
instruction = "How to code in python and c#"
response = ""

# Create the input prompt
prompt = f"Instruction:\n{instruction}\n\nResponse:\n{response}"
# Generate inference using the loaded model
inference_result = loaded_model.generate(prompt, max_length=1024)

# Print or use the generated response
print(inference_result)

Model Card

For more information about the model, check out its model card on Hugging Face Model Hub.

License

This project is licensed under the MIT License.