Model Card for Model ID

This bot gives a bitter review fn any paper you submit. See https://hippocampus-garden.com/tiny_llama_dpo_lora/ for full details.

Model Details

Model Description

  • Developed by: Shion Honda
  • Model type: Text Generation
  • Language(s) (NLP): English
  • License: MIT
  • Finetuned from model: TinyLlama/TinyLlama-1.1B-Chat-v1.0

Model Card Contact

[More Information Needed]

Framework versions

  • PEFT 0.10.0
Downloads last month
1
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora

Adapter
(887)
this model

Dataset used to train shionhonda/tiny-llama-reviewer2-1.1B-dpo-lora