Update README.md
Browse files
README.md
CHANGED
@@ -7,11 +7,11 @@ datasets:
|
|
7 |
|
8 |
**\*\*\*\*\*\*Update\*\*\*\*\*\***
|
9 |
|
10 |
-
This update introduces a new set of
|
11 |
-
For benchmarking purposes please use the existing
|
12 |
|
13 |
-
#
|
14 |
-
|
15 |
This is the base model to be used along with the adapters.
|
16 |
Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
|
17 |
|
@@ -23,7 +23,7 @@ Given the combination of title and abstract of a scientific paper or a short tex
|
|
23 |
|
24 |
## Model Description
|
25 |
|
26 |
-
|
27 |
Post that it is trained with additionally attached task format specific adapter modules on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks.
|
28 |
|
29 |
Task Formats trained on:
|
@@ -47,9 +47,9 @@ It builds on the work done in [SciRepEval: A Multi-Format Benchmark for Scientif
|
|
47 |
|
48 |
<!-- Provide the basic links for the model. -->
|
49 |
|
50 |
-
- **Repository:** [https://github.com/allenai/
|
51 |
- **Paper:** [https://api.semanticscholar.org/CorpusID:254018137](https://api.semanticscholar.org/CorpusID:254018137)
|
52 |
-
- **Demo:** [Usage](https://github.com/allenai/
|
53 |
|
54 |
# Uses
|
55 |
|
@@ -141,9 +141,9 @@ We also evaluate and establish a new SoTA on [MDCR](https://github.com/zoranmedi
|
|
141 |
|[SPECTER](https://huggingface.co/allenai/specter)|54.7|57.4|68.0|(30.6, 25.5)|
|
142 |
|[SciNCL](https://huggingface.co/malteos/scincl)|55.6|57.8|69.0|(32.6, 27.3)|
|
143 |
|[SciRepEval-Adapters](https://huggingface.co/models?search=scirepeval)|61.9|59.0|70.9|(35.3, 29.6)|
|
144 |
-
|[
|
145 |
|
146 |
-
Please cite the following works if you end up using
|
147 |
|
148 |
[SPECTER paper](https://api.semanticscholar.org/CorpusID:215768677):
|
149 |
|
|
|
7 |
|
8 |
**\*\*\*\*\*\*Update\*\*\*\*\*\***
|
9 |
|
10 |
+
This update introduces a new set of SPECTER2 models with the base transformer encoder pre-trained on an extended citation dataset containing more recent papers.
|
11 |
+
For benchmarking purposes please use the existing SPECTER2 [models](https://huggingface.co/allenai/specter2) w/o the **aug2023refresh** suffix.
|
12 |
|
13 |
+
# SPECTER2 (Base)
|
14 |
+
SPECTER2 is the successor to [SPECTER](https://huggingface.co/allenai/specter) and is capable of generating task specific embeddings for scientific tasks when paired with [adapters](https://huggingface.co/models?search=allenai/specter-2_).
|
15 |
This is the base model to be used along with the adapters.
|
16 |
Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
|
17 |
|
|
|
23 |
|
24 |
## Model Description
|
25 |
|
26 |
+
SPECTER2 has been trained on over 6M triplets of scientific paper citations, which are available [here](https://huggingface.co/datasets/allenai/scirepeval/viewer/cite_prediction_new/evaluation).
|
27 |
Post that it is trained with additionally attached task format specific adapter modules on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks.
|
28 |
|
29 |
Task Formats trained on:
|
|
|
47 |
|
48 |
<!-- Provide the basic links for the model. -->
|
49 |
|
50 |
+
- **Repository:** [https://github.com/allenai/SPECTER2](https://github.com/allenai/SPECTER2)
|
51 |
- **Paper:** [https://api.semanticscholar.org/CorpusID:254018137](https://api.semanticscholar.org/CorpusID:254018137)
|
52 |
+
- **Demo:** [Usage](https://github.com/allenai/SPECTER2/blob/main/README.md)
|
53 |
|
54 |
# Uses
|
55 |
|
|
|
141 |
|[SPECTER](https://huggingface.co/allenai/specter)|54.7|57.4|68.0|(30.6, 25.5)|
|
142 |
|[SciNCL](https://huggingface.co/malteos/scincl)|55.6|57.8|69.0|(32.6, 27.3)|
|
143 |
|[SciRepEval-Adapters](https://huggingface.co/models?search=scirepeval)|61.9|59.0|70.9|(35.3, 29.6)|
|
144 |
+
|[SPECTER2-Adapters](https://huggingface.co/models?search=allenai/specter-2)|**62.3**|**59.2**|**71.2**|**(38.4, 33.0)**|
|
145 |
|
146 |
+
Please cite the following works if you end up using SPECTER2:
|
147 |
|
148 |
[SPECTER paper](https://api.semanticscholar.org/CorpusID:215768677):
|
149 |
|