AronXiang
/

RetrospexLLaMA3

Model card Files Files and versions Community

Model Card for Model ID

This model is trained by lora for Retrospex based on AgentInstruct and ShareGPT datasets. The base model is Llama-3-8B-Instruct.

Model Details

Model Description

Developed by: Convai NJU
Shared by [optional]: Convai NJU
Model type: Llama model
Language(s) (NLP): en
License: llama3
Finetuned from model [optional]: Llama-3-8B-Instruct

Model Sources

Repository: https://github.com/Yufei-Xiang/Retrospex.git

Training Details

Training Data

AgentInstruct: https://huggingface.co/datasets/THUDM/AgentInstruct

ShareGPT: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered

Training Hyperparameters

fp16: True
lr: 2e-5
batch size: 8
lora r: 16
lora alpha: 64

Downloads last month: 1

Safetensors

Model size

8.03B params

Tensor type

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AronXiang/RetrospexLLaMA3

Quantizations

Datasets used to train AronXiang/RetrospexLLaMA3