webersni commited on
Commit
afa4bd2
1 Parent(s): 81225f2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -1
README.md CHANGED
@@ -3,9 +3,34 @@ language: en
3
  license: apache-2.0
4
  ---
5
 
 
 
 
 
 
 
 
 
6
  Using the [DistilRoBERTa](https://huggingface.co/distilroberta-base) model as starting point, the ClimateBERT Language Model is additionally pretrained on a text corpus comprising climate-related research paper abstracts, corporate and general news and reports from companies. The underlying methodology can be found in our [language model research paper](https://arxiv.org/abs/2110.12010).
7
 
8
- ### BibTeX entry and citation info
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ```bibtex
10
  @article{wkbl2021,
11
  title={ClimateBERT: A Pretrained Language Model for Climate-Related Text},
 
3
  license: apache-2.0
4
  ---
5
 
6
+ # Model Card for distilroberta-base-climate-s
7
+
8
+ ## Model Description
9
+
10
+ This is the ClimateBERT language model based on the SIM-SELECT sample selection strategy.
11
+
12
+ *Note: We generally recommend choosing the [distilroberta-base-climate-f](https://huggingface.co/climatebert/distilroberta-base-climate-f) language model over this language model (unless you have good reasons not to).*
13
+
14
  Using the [DistilRoBERTa](https://huggingface.co/distilroberta-base) model as starting point, the ClimateBERT Language Model is additionally pretrained on a text corpus comprising climate-related research paper abstracts, corporate and general news and reports from companies. The underlying methodology can be found in our [language model research paper](https://arxiv.org/abs/2110.12010).
15
 
16
+ ## Climate performance card
17
+
18
+ | distilroberta-base-climate-s | |
19
+ |--------------------------------------------------------------------------|----------------|
20
+ | 1. Is the resulting model publicly available? | Yes |
21
+ | 2. How much time does the training of the final model take? | 8 hours |
22
+ | 3. How much time did all experiments take (incl. hyperparameter search)? | 288 hours |
23
+ | 4. What was the energy consumption (GPU/CPU)? | 0.7 kW |
24
+ | 5. At which geo location were the computations performed? | Germany |
25
+ | 6. What was the energy mix at the geo location? | 470 gCO2eq/kWh |
26
+ | 7. How much CO2eq was emitted to train the final model? | 2.63 kg |
27
+ | 8. How much CO2eq was emitted for all experiments? | 94.75 kg |
28
+ | 9. What is the average CO2eq emission for the inference of one sample? | 0.62 mg |
29
+ | 10. Which positive environmental impact can be expected from this work? | This work can be categorized as a building block tools following Jin et al (2021). It supports the training of NLP models in the field of climate change and, thereby, have a positive environmental impact in the future. |
30
+ | 11. Comments | Block pruning could decrease CO2eq emissions |
31
+
32
+ ### Citation Information
33
+
34
  ```bibtex
35
  @article{wkbl2021,
36
  title={ClimateBERT: A Pretrained Language Model for Climate-Related Text},