Text Generation
PEFT
Safetensors
lora
dpo
trl
tinyllama
pharmaceutical
medical
preference-tuning
alignment
Instructions to use ssuvetha/pharma-tinyllama-dpo-lora-adapter with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use ssuvetha/pharma-tinyllama-dpo-lora-adapter with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("/content/pharma_tinyllama_instruction_merged_model") model = PeftModel.from_pretrained(base_model, "ssuvetha/pharma-tinyllama-dpo-lora-adapter") - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!