ringoreality commited on
Commit
b792fa5
1 Parent(s): 6f66d33

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model Card for climategpt/climategpt-70b
2
+ - This model is the 70B parameter variant of the ClimateGPT model release.
3
+ - Starting from Llama2 70B weights, the model undergoes continued pretraining and instruction finetuning on climate data.
4
+ - The model is capable of answering questions and following instructions, especially tailored for the climate domain.
5
+
6
+ ## Overview
7
+ - **Developed by:** AppTek, Eqtylab, Erasmus AI
8
+ - **Model type:** decoder-only Transformer
9
+ - **Language(s) (NLP):** natively supported: English; supported via cascaded MT on web interface: Arabic, Bangla, Chinese (simplified), Dutch, Finnougoric, French, Germanic, Greek, Hebrew, Indonesian, Japenese, Korean, Lithuanian, Pashto, Persian, Portuguese, Russian, Spanish, Thai, Turkish, Vietnamese,
10
+ - **License:** TO BE ADDED
11
+ - **Finetuned from model:** Llama2 70B
12
+ - **Repository:** https://huggingface.co/climategpt/climategpt-70b
13
+ - **Paper:** TO BE ADDED
14
+ - **Demo:** TO BE ADDED
15
+
16
+ ## Uses
17
+ - This model is intended to be directly used as a question answering model that is specialized in the climate domain.
18
+ - The model is aimed at providing useful feedback for decision makers, scientists and jounalists involved in climate discussions.
19
+ - The model can also be used as a starting point for interested developers for further finetuning.
20
+ - The model is NOT intended to be a general-purpose chatbot (although it has chat capabilities).
21
+ - For the full system including cascaded MT, RAG, etc., we recommend the user to go to our demo website: TO BE ADDED.
22
+ - For hands-on finetuning deployment and inference, we recommend the user to directly use the Huggingface helpers.
23
+ - For in-depth model conversion and finetuning, we recommend the user to use https://github.com/epfLLM/Megatron-LLM/.
24
+ - **Despite the efforts from the development team to elimite them, as every other chat-capable LLMs, this model may generate biased, offensive, inaccurate responses.**
25
+
26
+ ## How to Get Started with the Model
27
+ After downloading the HF formatted model, the HF helpers should work out-of-the-box.
28
+ For example, it is possible to evaluate the model with https://github.com/EleutherAI/lm-evaluation-harness by plugging in the model identifier ```--model_args pretrained=climategpt/climategpt-70b```.
29
+
30
+ ## Training
31
+ - For the Llama2 training data, we refer the user to https://huggingface.co/meta-llama/Llama-2-70b-chat-hf.
32
+ - For continued pretraining, 4.2B climate domain tokens (tokenized by the Llama tokenizer) are used.
33
+ - For instruction finetuning, about 579K instruction-completion pairs (both in the climate domain but also general domain) are used.
34
+
35
+ ## Evaluation
36
+ Automatic evaluation is done via https://github.com/EleutherAI/lm-evaluation-harness, into which we also implemented custom evaluation tasks.
37
+ TO BE ADDED
38
+ We also perform human evaluation with experts in the climate domain.
39
+ TO BE ADDED
40
+
41
+ ## Environmental Impact
42
+ - **Hardware Type:** H100
43
+ - **Hours used:** 2300 hrs
44
+ - **Cloud Provider:** TO BE ADDED
45
+ - **Compute Region:** TO BE ADDED
46
+ - **Carbon Emitted:** TO BE ADDED
47
+
48
+ ## Citation
49
+ **BibTeX:** TO BE ADDED