allenai
/

specter2_regression

Adapters

bert

Model card Files Files and versions Community

aps6992 commited on Aug 15, 2023

Commit

c8176e1

•

1 Parent(s): 6401e6b

Update README.md

Browse files

Files changed (1) hide show

README.md +24 -6

README.md CHANGED Viewed

@@ -34,20 +34,38 @@ adapter_name = model.load_adapter("allenai/specter2_regression", source="hf", se
 <!-- Provide a quick summary of what the model is/does. -->
-SPECTER 2.0 is the successor to [SPECTER](allenai/specter) and is capable of generating task specific embeddings for scientific tasks when paired with [adapters](https://huggingface.co/models?search=allenai/specter-2).
 Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
 # Model Details
 ## Model Description
 SPECTER 2.0 has been trained on over 6M triplets of scientific paper citations, which are available [here](https://huggingface.co/datasets/allenai/scirepeval/viewer/cite_prediction_new/evaluation).
-Post that it is trained on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks, with task format specific adapters.
 Task Formats trained on:
 - Classification
 - Regression
-- Proximity
 - Adhoc Search
 **This is the regression specific adapter. For generating embeddings which can be used as input to downstream regression models like SVRs to generate a continuous value as the result.**
@@ -79,12 +97,12 @@ It builds on the work done in [SciRepEval: A Multi-Format Benchmark for Scientif
 |Model|Name and HF link|Description|
 |--|--|--|
-|Retrieval*|[allenai/specter2_proximity](https://huggingface.co/allenai/specter2)|Encode papers as queries and candidates eg. Link Prediction, Nearest Neighbor Search|
-|Adhoc Query|[allenai/specter2_adhoc_query](https://huggingface.co/allenai/specter2_adhoc_query)|Encode short raw text queries for search tasks. (Candidate papers can be encoded with proximity)|
 |Classification|[allenai/specter2_classification](https://huggingface.co/allenai/specter2_classification)|Encode papers to feed into linear classifiers as features|
 |Regression|[allenai/specter2_regression](https://huggingface.co/allenai/specter2_regression)|Encode papers to feed into linear regressors as features|
-*Retrieval model should suffice for downstream task types not mentioned above
 ```python
 from transformers import AutoTokenizer, AutoModel

 <!-- Provide a quick summary of what the model is/does. -->
+**Aug 2023 Update:**
+1. The SPECTER 2.0 Base and proximity adapter models have been renamed in Hugging Face based upon usage patterns as follows:
+|Old Name|New Name|
+|--|--|
+|allenai/specter2|[allenai/specter2_base](https://huggingface.co/allenai/specter2_base)|
+|allenai/specter2_proximity|[allenai/specter2](https://huggingface.co/allenai/specter2)|
+2. We have a parallel version (termed [aug2023refresh](https://huggingface.co/allenai/specter2_aug2023refresh)) where the base transformer encoder version is pre-trained on a collection of newer papers (published after 2018).
+   However, for benchmarking purposes, please continue using the current version.
+SPECTER 2.0 is the successor to [SPECTER](https://huggingface.co/allenai/specter) and is capable of generating task specific embeddings for scientific tasks when paired with [adapters](https://huggingface.co/models?search=allenai/specter-2_).
+This is the base model to be used along with the adapters.
 Given the combination of title and abstract of a scientific paper or a short texual query, the model can be used to generate effective embeddings to be used in downstream applications.
+**Note:For general embedding purposes, please use [allenai/specter2](https://huggingface.co/allenai/specter2).**
+**To get the best performance on a downstream task type please load the associated adapter with the base model as in the example below.**
 # Model Details
 ## Model Description
 SPECTER 2.0 has been trained on over 6M triplets of scientific paper citations, which are available [here](https://huggingface.co/datasets/allenai/scirepeval/viewer/cite_prediction_new/evaluation).
+Post that it is trained with additionally attached task format specific adapter modules on all the [SciRepEval](https://huggingface.co/datasets/allenai/scirepeval) training tasks.
 Task Formats trained on:
 - Classification
 - Regression
+- Proximity (Retrieval)
 - Adhoc Search
 **This is the regression specific adapter. For generating embeddings which can be used as input to downstream regression models like SVRs to generate a continuous value as the result.**
 |Model|Name and HF link|Description|
 |--|--|--|
+|Proximity*|[allenai/specter2](https://huggingface.co/allenai/specter2)|Encode papers as queries and candidates eg. Link Prediction, Nearest Neighbor Search|
+|Adhoc Query|[allenai/specter2_adhoc_query](https://huggingface.co/allenai/specter2_adhoc_query)|Encode short raw text queries for search tasks. (Candidate papers can be encoded with the proximity adapter)|
 |Classification|[allenai/specter2_classification](https://huggingface.co/allenai/specter2_classification)|Encode papers to feed into linear classifiers as features|
 |Regression|[allenai/specter2_regression](https://huggingface.co/allenai/specter2_regression)|Encode papers to feed into linear regressors as features|
+*Proximity model should suffice for downstream task types not mentioned above
 ```python
 from transformers import AutoTokenizer, AutoModel