marcellopoliti commited on
Commit
c137590
1 Parent(s): 455da5c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -9
README.md CHANGED
@@ -1,11 +1,3 @@
1
- ##RemoteSensing Distilbert
2
- The field of earth observation is expanding all the time. More and more data scientists are interested in this domain, and they're developing computer vision applications that do amazing things, while NLP doesn't seem to be given much consideration in this area
3
- That's why I posted Chramer/remote-sensing-distilbert-cased. A masked language model trained on a corpus of technical information about space missions, instruments, and sensors.
4
-
5
- The model is distilbert-based, but I didn't have the chance to play with the hyperparameters of the model because of the limited computational capabilities I have.
6
- It was fun to publish my first model on hugging face!
7
-
8
-
9
  ---
10
 
11
  widget:
@@ -13,4 +5,34 @@ widget:
13
  - text: "Multiple [MASK] channels enable full polarimetry"
14
  - text: "The [MASK] is capable of measuring in limb and nadir geometry"
15
 
16
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
 
3
  widget:
 
5
  - text: "Multiple [MASK] channels enable full polarimetry"
6
  - text: "The [MASK] is capable of measuring in limb and nadir geometry"
7
 
8
+ ---
9
+
10
+ # RemoteSensing Distilbert
11
+
12
+ ![alt text](https://media.istockphoto.com/photos/space-communications-satellite-in-low-orbit-around-the-earth-elements-picture-id1062473882?b=1&k=20&m=1062473882&s=170667a&w=0&h=KWJwGSiXBffLgKdaQTxY-eY7ljJE5_3khXgQyAQHPbU=)
13
+
14
+ The field of earth observation is increasingly growing. More and more data scientists are interested about this domain, and they're developing computer vision applications that do amazing things, while NLP doesn't seem to be given much consideration in this area
15
+ That's why I posted [Chramer/remote-sensing-distilbert-cased](https://huggingface.co/Chramer/remote-sensing-distilbert-cased). This is masked language model trained on a corpus of technical information about space missions, instruments, and sensors.
16
+
17
+ The model is based on [distilbert-base-cased](https://huggingface.co/distilbert-base-uncased), but I didn't have the chance to play with the hyperparameters of the model because of the limited computational capabilities I have. So there's a lot to improve! 😆
18
+
19
+ It was fun to publish my first model on hugging face! 🤩
20
+
21
+ # Perplexity
22
+
23
+ Test set: 4.5k sentences about technical space stuff.
24
+
25
+ | Model | Perplexity |
26
+ | ------ | ------ |
27
+ | remote-sensing-distilbert-cased | **6.45** |
28
+ | distilbert-base-cased | 33.77 |
29
+
30
+
31
+ # Usage
32
+
33
+ ```python
34
+ from transformers import AutoModel, AutoTokenizer
35
+ model_name = "Chramer/remote-sensing-distilbert-cased"
36
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
37
+ model = AutoModel.from_pretrained(model_name)
38
+ ```