Edit model card

Model Card for Model ID

This bot gives a bitter review fn any paper you submit. See https://hippocampus-garden.com/tiny_llama_dpo_lora/ for full details.

Model Details

Model Description

  • Developed by: Shion Honda
  • Model type: Text Generation
  • Language(s) (NLP): English
  • License: MIT
  • Finetuned from model: TinyLlama/TinyLlama-1.1B-Chat-v1.0

Model Card Contact

[More Information Needed]

Framework versions

  • PEFT 0.10.0
Downloads last month
4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Adapter for

Dataset used to train shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora