Solshine's picture
Update README.md
577aa4c verified
|
raw
history blame
848 Bytes
metadata
base_model: Solshine/reflection-llama-3.1-8B-Solshine-trainround1-16bit
language:
  - en
license: llama3.1
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - llama
  - trl
  - sft
  - reflection
datasets:
  - Harshkmr/orca-math-word-reflection

Uploaded model

  • Developed by: Solshine
  • License: LLama 3.1 License
  • Finetuned from model : Solshine/reflection-llama-3.1-8B-Solshine-trainround1-16bit

Inspired by and featuring the Reflection Tuning technique pioneered by Matt Shumer (possibly earlier innovated by the team at Anthropic.)

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.