File size: 765 Bytes
8ba5e0b
3e5f340
 
 
 
8ba5e0b
 
3e5f340
ff6befc
8ba5e0b
dfe5518
3e5f340
 
 
 
 
 
 
 
 
 
 
f3cd44a
ff6befc
dfe5518
 
 
855a899
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
language:
  - java
  - code
license: apache-2.0
datasets:
- code_search_net
widget:
  - text: 'public <mask> isOdd(Integer num){if (num % 2 == 0) {return "even";} else {return "odd";}}'
---
## JavaRoBERTa-Tara 
A RoBERTa model pretrained on, code_search_net Java software code.

### Training Data
The model was trained on 10,223,695 Java files retrieved from open source projects on GitHub. 

### Training Objective
A MLM (Masked Language Model) objective was used to train this model.

### Usage
```python
from transformers import pipeline
pipe = pipeline('fill-mask', model='emre/java-RoBERTa-Tara-small')
output = pipe(CODE) # Replace with Java code; Use '<mask>' to mask tokens/words in the code.
```

### Why Tara?
she is the name of my little baby girl :)