julien-c HF staff commited on
Commit
ab0a023
1 Parent(s): 23f4690

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/aliosm/ai-soco-cpp-roberta-tiny-clas/README.md

Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: "c++"
3
+ tags:
4
+ - exbert
5
+ - authorship-identification
6
+ - fire2020
7
+ - pan2020
8
+ - ai-soco
9
+ - classification
10
+ license: "mit"
11
+ datasets:
12
+ - ai-soco
13
+ metrics:
14
+ - accuracy
15
+ ---
16
+
17
+ # ai-soco-c++-roberta-tiny-clas
18
+
19
+ ## Model description
20
+
21
+ `ai-soco-c++-roberta-tiny` model fine-tuned on [AI-SOCO](https://sites.google.com/view/ai-soco-2020) task.
22
+
23
+ #### How to use
24
+
25
+ You can use the model directly after tokenizing the text using the provided tokenizer with the model files.
26
+
27
+ #### Limitations and bias
28
+
29
+ The model is limited to C++ programming language only.
30
+
31
+ ## Training data
32
+
33
+ The model initialized from [`ai-soco-c++-roberta-tiny`](https://github.com/huggingface/transformers/blob/master/model_cards/aliosm/ai-soco-c++-roberta-tiny) model and trained using [AI-SOCO](https://sites.google.com/view/ai-soco-2020) dataset to do text classification.
34
+
35
+ ## Training procedure
36
+
37
+ The model trained on Google Colab platform using V100 GPU for 10 epochs, 32 batch size, 512 max sequence length (sequences larger than 512 were truncated). Each continues 4 spaces were converted to a single tab character (`\t`) before tokenization.
38
+
39
+ ## Eval results
40
+
41
+ The model achieved 87.66%/87.46% accuracy on AI-SOCO task and ranked in the 9th place.
42
+
43
+ ### BibTeX entry and citation info
44
+
45
+ ```bibtex
46
+ @inproceedings{ai-soco-2020-fire,
47
+ title = "Overview of the {PAN@FIRE} 2020 Task on {Authorship Identification of SOurce COde (AI-SOCO)}",
48
+ author = "Fadel, Ali and Musleh, Husam and Tuffaha, Ibraheem and Al-Ayyoub, Mahmoud and Jararweh, Yaser and Benkhelifa, Elhadj and Rosso, Paolo",
49
+ booktitle = "Proceedings of The 12th meeting of the Forum for Information Retrieval Evaluation (FIRE 2020)",
50
+ year = "2020"
51
+ }
52
+ ```
53
+
54
+ <a href="https://huggingface.co/exbert/?model=aliosm/ai-soco-c++-roberta-tiny-clas">
55
+ <img width="300px" src="https://cdn-media.huggingface.co/exbert/button.png">
56
+ </a>