Model Card for Model ID
This bot gives a bitter review fn any paper you submit. See https://hippocampus-garden.com/tiny_llama_dpo_lora/ for full details.
Model Details
Model Description
- Developed by: Shion Honda
- Model type: Text Generation
- Language(s) (NLP): English
- License: MIT
- Finetuned from model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
Model Card Contact
[More Information Needed]
Framework versions
- PEFT 0.10.0
- Downloads last month
- 4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora
Base model
TinyLlama/TinyLlama-1.1B-Chat-v1.0