allenai
/

specter2

Adapters

bert

Model card Files Files and versions Community

aps6992 commited on Aug 15, 2023

Commit

4952134

•

1 Parent(s): b6edb72

Update README.md

Browse files

Files changed (1) hide show

README.md +25 -7

README.md CHANGED Viewed

@@ -41,21 +41,38 @@ adapter_name = model.load_adapter("allenai/specter2", source="hf", set_active=Tr
 <!-- Provide a quick summary of what the model is/does. -->
-SPECTER 2.0 is the successor to [SPECTER](allenai/specter) and is capable of generating task specific embeddings for scientific tasks when paired with [adapters](https://huggingface.co/models?search=allenai/specter-2).
-This is the proximity adapter and should be used for all general embedding purposes.
 Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
 # Model Details
 ## Model Description
 SPECTER 2.0 has been trained on over 6M triplets of scientific paper citations, which are available [here](https://huggingface.co/datasets/allenai/scirepeval/viewer/cite_prediction_new/evaluation).
-Post that it is trained on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks, with task format specific adapters.
 Task Formats trained on:
 - Classification
 - Regression
-- Proximity
 - Adhoc Search
 This is a retrieval specific adapter. For tasks where given a paper query, other relevant papers have to be retrieved from a corpus, use this adapter to generate the embeddings.
@@ -87,12 +104,13 @@ It builds on the work done in [SciRepEval: A Multi-Format Benchmark for Scientif
 |Model|Name and HF link|Description|
 |--|--|--|
-|Retrieval*|[allenai/specter2_proximity](https://huggingface.co/allenai/specter2)|Encode papers as queries and candidates eg. Link Prediction, Nearest Neighbor Search|
-|Adhoc Query|[allenai/specter2_adhoc_query](https://huggingface.co/allenai/specter2_adhoc_query)|Encode short raw text queries for search tasks. (Candidate papers can be encoded with proximity)|
 |Classification|[allenai/specter2_classification](https://huggingface.co/allenai/specter2_classification)|Encode papers to feed into linear classifiers as features|
 |Regression|[allenai/specter2_regression](https://huggingface.co/allenai/specter2_regression)|Encode papers to feed into linear regressors as features|
-*Retrieval model should suffice for downstream task types not mentioned above
 ```python
 from transformers import AutoTokenizer, AutoModel

 <!-- Provide a quick summary of what the model is/does. -->
+**Aug 2023 Update:**
+1. The SPECTER 2.0 Base and proximity adapter models have been renamed in Hugging Face based upon usage patterns as follows:
+|Old Name|New Name|
+|--|--|
+|allenai/specter2|[allenai/specter2_base](https://huggingface.co/allenai/specter2_base)|
+|allenai/specter2_proximity|[allenai/specter2](https://huggingface.co/allenai/specter2)|
+2. We have a parallel version (termed [aug2023refresh](https://huggingface.co/allenai/specter2_aug2023refresh)) where the base transformer encoder version is pre-trained on a collection of newer papers (published after 2018).
+   However, for benchmarking purposes, please continue using the current version.
+SPECTER 2.0 is the successor to [SPECTER](https://huggingface.co/allenai/specter) and is capable of generating task specific embeddings for scientific tasks when paired with [adapters](https://huggingface.co/models?search=allenai/specter-2_).
+This is the base model to be used along with the adapters.
 Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
+**Note:For general embedding purposes, please use [allenai/specter2](https://huggingface.co/allenai/specter2).**
+**To get the best performance on a downstream task type please load the associated adapter with the base model as in the example below.**
 # Model Details
 ## Model Description
 SPECTER 2.0 has been trained on over 6M triplets of scientific paper citations, which are available [here](https://huggingface.co/datasets/allenai/scirepeval/viewer/cite_prediction_new/evaluation).
+Post that it is trained with additionally attached task format specific adapter modules on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks.
 Task Formats trained on:
 - Classification
 - Regression
+- Proximity (Retrieval)
 - Adhoc Search
 This is a retrieval specific adapter. For tasks where given a paper query, other relevant papers have to be retrieved from a corpus, use this adapter to generate the embeddings.
 |Model|Name and HF link|Description|
 |--|--|--|
+|Proximity*|[allenai/specter2](https://huggingface.co/allenai/specter2)|Encode papers as queries and candidates eg. Link Prediction, Nearest Neighbor Search|
+|Adhoc Query|[allenai/specter2_adhoc_query](https://huggingface.co/allenai/specter2_adhoc_query)|Encode short raw text queries for search tasks. (Candidate papers can be encoded with the proximity adapter)|
 |Classification|[allenai/specter2_classification](https://huggingface.co/allenai/specter2_classification)|Encode papers to feed into linear classifiers as features|
 |Regression|[allenai/specter2_regression](https://huggingface.co/allenai/specter2_regression)|Encode papers to feed into linear regressors as features|
+*Proximity model should suffice for downstream task types not mentioned above
 ```python
 from transformers import AutoTokenizer, AutoModel