Add pipeline usage to the model card
Browse files
README.md
CHANGED
@@ -7,7 +7,29 @@ This is a codebert model for detecting Python clone codes, fine-tuned on the dat
|
|
7 |
|
8 |
# How to use
|
9 |
|
10 |
-
To use the model, you can follow the
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
|
12 |
# Credits
|
13 |
|
|
|
7 |
|
8 |
# How to use
|
9 |
|
10 |
+
To use the model, you can follow the original inference source code at https://github.com/sangHa0411/CloneDetection/blob/main/inference.py.
|
11 |
+
|
12 |
+
More conveniently, a pipeline for this model has been implemented, and you can initialize it with only two lines of code:
|
13 |
+
```python
|
14 |
+
from transformers import pipeline
|
15 |
+
|
16 |
+
pipe = pipeline(model="Lazyhope/python-clone-detection", trust_remote_code=True)
|
17 |
+
```
|
18 |
+
To use it, pass a tuple of code pairs:
|
19 |
+
```python
|
20 |
+
code1 = """def token_to_inputs(feature):
|
21 |
+
inputs = {}
|
22 |
+
for k, v in feature.items():
|
23 |
+
inputs[k] = torch.tensor(v).unsqueeze(0)
|
24 |
+
|
25 |
+
return inputs"""
|
26 |
+
code2 = """def f(feature):
|
27 |
+
return {k: torch.tensor(v).unsqueeze(0) for k, v in feature.items()}"""
|
28 |
+
|
29 |
+
is_clone = pipe((code1, code2))
|
30 |
+
is_clone
|
31 |
+
# {False: 1.3705984201806132e-05, True: 0.9999862909317017}
|
32 |
+
```
|
33 |
|
34 |
# Credits
|
35 |
|