Rhino-Mistral-7B / README.md
Locutusque's picture
Create README.md
8a3ca34 verified
|
raw
history blame
No virus
4.58 kB
metadata
license: cc-by-nc-4.0
datasets:
  - M4-ai/Rhino
language:
  - en
base_model: mistralai/Mistral-7B-v0.1
co2_eq_emissions:
  emissions: 3.8

Model Card for Model ID

This model aims to be a high-performance chatbot. During training, examples that have a quality score of less than 0.03 are skipped.

Model Details

Model Description

This model is to be used as a general-purpose chatbot/assistant. Trained on about 300,000 examples of M4-ai/Rhino, examples with a quality score lower than 0.03 are removed. During validation, this model achieved a loss of 0.55

  • Developed by: Locutusque
  • Model type: mistral
  • Language(s) (NLP): English
  • License: cc-by-nc-4.0
  • Finetuned from model: mistralai/Mistral-7B-v0.1

Uses

This model is to be used as a general-purpose assistant, and may need to be further fine-tuned on DPO to detoxify the model or SFT for a more specific task.

Direct Use

This model should be used as a general assistant. this model is Capable of writing code, answering questions, and following instructions.

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

Training Details

Training Data

[More Information Needed]

Training Procedure

Training Hyperparameters

  • Training regime: bf16 non-mixed precision

Evaluation

Testing Data, Factors & Metrics

Testing Data

First 100 examples of M4-ai/Rhino. Training data does not include these examples.

Results

Test loss - 0.55

Summary

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: 8 TPU V3s
  • Hours used: 24
  • Cloud Provider: Kaggle
  • Compute Region: [More Information Needed]
  • Carbon Emitted: 3.8

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]