Commit
β’
d862046
1
Parent(s):
106ae52
Update README.md
Browse files
README.md
CHANGED
@@ -9,10 +9,10 @@ datasets:
|
|
9 |
- argilla/alpaca-gigo-detector
|
10 |
---
|
11 |
|
12 |
-
# π΅βπ«π¦ Alpaca
|
13 |
|
14 |
A cross-lingual SetFit model to **detect bad instructions from Alpaca Datasets** and potentially other instruction-following datasets.
|
15 |
-
`
|
16 |
|
17 |
|
18 |
<div style="text-align:center">
|
@@ -33,7 +33,7 @@ from setfit import SetFitModel
|
|
33 |
|
34 |
# Download from Hub
|
35 |
model = SetFitModel.from_pretrained(
|
36 |
-
"argilla/alpaca-
|
37 |
)
|
38 |
|
39 |
text = """
|
@@ -88,7 +88,7 @@ Load the model:
|
|
88 |
from setfit import SetFitModel
|
89 |
|
90 |
# Download from Hub
|
91 |
-
model = SetFitModel.from_pretrained("argilla/alpaca-
|
92 |
```
|
93 |
|
94 |
Perform inference and prediction col to your dataset:
|
|
|
9 |
- argilla/alpaca-gigo-detector
|
10 |
---
|
11 |
|
12 |
+
# π΅βπ«π¦ Alpaca GarbageCollector
|
13 |
|
14 |
A cross-lingual SetFit model to **detect bad instructions from Alpaca Datasets** and potentially other instruction-following datasets.
|
15 |
+
`GarbageCollector` can greatly speed up the validation of Alpaca Datasets across many languages, flagging examples that need to be fixed or simply discarded.
|
16 |
|
17 |
|
18 |
<div style="text-align:center">
|
|
|
33 |
|
34 |
# Download from Hub
|
35 |
model = SetFitModel.from_pretrained(
|
36 |
+
"argilla/alpaca-garbage-collector-multilingual"
|
37 |
)
|
38 |
|
39 |
text = """
|
|
|
88 |
from setfit import SetFitModel
|
89 |
|
90 |
# Download from Hub
|
91 |
+
model = SetFitModel.from_pretrained("argilla/alpaca-garbage-collector-multilingual")
|
92 |
```
|
93 |
|
94 |
Perform inference and prediction col to your dataset:
|