TownsWu
/

PEG

Sentence Similarity

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

TownsWu commited on Nov 15, 2023

Commit

f103047

·

1 Parent(s): fb77c59

Update README.md

Files changed (1) hide show

README.md +30 -11

README.md CHANGED Viewed

@@ -1,6 +1,14 @@
 ---
 language:
 - zh
 ---
 # Model Card for Model ID
@@ -10,21 +18,32 @@ This modelcard aims to be a base template for new models. It has been generated
 ## Model Details
 We propose the PEG model (a Progressively Learned Textual Embedding), which progressively adjusts the weights of samples contributing to the loss within an extremely large batch, based on the difficulty levels of negative samples.
-We have collected a large-scale retrieval training dataset, consisting of 110 million queries, where each query is paired with one positive sample and five carefully selected hard negatives.
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

 ---
 language:
 - zh
+pipeline_tag: sentence-similarity
+tags:
+- PEG
+- feature-extraction
+- sentence-similarity
+- transformers
+license: apache-2.0
+library_name: transformers
 ---
 # Model Card for Model ID
 ## Model Details
 We propose the PEG model (a Progressively Learned Textual Embedding), which progressively adjusts the weights of samples contributing to the loss within an extremely large batch, based on the difficulty levels of negative samples.
+we have amassed an extensive collection of over 110 million data, spanning a wide range of fields such as general knowledge, finance, tourism, medicine, and more.
+## Usage (HuggingFace Transformers)
+Install transformers:
+```
+pip install transformers
+```
+Then load model and predict:
+```python
+from transformers import AutoModel, AutoTokenizer
+import torch
+# Load model from HuggingFace Hub
+tokenizer = AutoTokenizer.from_pretrained('TownsWu/PEG')
+model = AutoModel.from_pretrained('TownsWu/PEG')
+sentences = ['如何更换花呗绑定银行卡', '花呗更改绑定银行卡']
+# Tokenize sentences
+inputs = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')
+# Compute token embeddings
+with torch.no_grad():
+    last_hidden_state = model(**inputs, return_dict=True).last_hidden_state
+    embeddings = last_hidden_state[:, 0]
+print("embeddings:")
+print(embeddings)