Instructions to use coder1969/gemma-2-2b-scientific-summarizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use coder1969/gemma-2-2b-scientific-summarizer with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("google/gemma-2-2b") model = PeftModel.from_pretrained(base_model, "coder1969/gemma-2-2b-scientific-summarizer") - Notebooks
- Google Colab
- Kaggle
Model Card for gemma-2-2b-scientific-summarizer
This is a Parameter-Efficient Fine-Tuning (PEFT) LoRA adapter for google/gemma-2-2b optimized for scientific lay summarization and key-point extraction.
Model Details
Model Description
- Developed by: coder1969
- Model type: PEFT (LoRA) adapter for Causal Language Modeling
- Language(s) (NLP): English
- License: Apache-2.0
- Finetuned from model:
google/gemma-2-2b
Uses
Direct Use
This model is directly intended for taking scientific abstract/literature context and generating structured lay summaries or key points.
Out-of-Scope Use
- Clinical or medical diagnostics without peer review.
- Automated code generation or general chatbot applications.
Bias, Risks, and Limitations
Users should be aware that language models can hallucinate or omit critical details from context. Outputs should be verified against original sources.
How to Get Started with the Model
Use the code below to load the base model and apply the fine-tuned adapter weights:
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
base_model_name = "google/gemma-2-2b"
adapter_model_name = "coder1969/gemma-2-2b-scientific-summarizer"
# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(base_model_name)
if tokenizer.pad_token is None:
tokenizer.pad_token = tokenizer.eos_token
# Load base model
model = AutoModelForCausalLM.from_pretrained(
base_model_name,
torch_dtype=torch.float16,
device_map="auto"
)
# Apply adapters
model = PeftModel.from_pretrained(model, adapter_model_name)
# Inference Example
prompt = "Document:\nTopic: quantum machine learning\n\nRelevant Literature:\n[Insert relevant abstracts or papers here]\n\nSummary:\n"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens=256,
do_sample=True,
temperature=0.7,
top_p=0.9
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Training Details
Training Data
Finetuned on the tomasg25/scientific_lay_summarisation dataset (subset: plos), containing pairs of scientific articles and summaries.
Training Hyperparameters
- LoRA Config:
- Rank (r): 8
- Alpha: 16
- Dropout: 0.1
- Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
- Optimization parameters:
- Learning Rate: 2e-05
- Batch Size: 8
- Gradient Accumulation Steps: 4
- Epochs: 3
- Weight Decay: 0.01
- Precision: Mixed precision (FP16)
Framework versions
- PEFT 0.19.1
- Transformers 4.40.0+
- PyTorch 2.0+
- Downloads last month
- 23
Model tree for coder1969/gemma-2-2b-scientific-summarizer
Base model
google/gemma-2-2b