Edit model card

SentenceTransformer based on sentence-transformers/paraphrase-multilingual-mpnet-base-v2

This is a sentence-transformers model finetuned from sentence-transformers/paraphrase-multilingual-mpnet-base-v2 on the sscc, sscd, apnc and apnd datasets. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: XLMRobertaModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("Anakeen/datasets_ssc_apn_nlpaug-s_sts_1")
# Run inference
sentences = [
    'The liability of the Reinsured to indemnify Capricorn (Isle of Man) Limited who in turn offer a discretionary deed of indemnity to Capricorn Mutual Limited ABN 24 104 601 194 for the protections set out below.',
    'The liability of the Reinsured to compensate Capricorn the goat (Islet of Human being) Limited world health organization in turn offer a discretional human activity of indemnity to Goat Mutual Limited ABN two dozen 104 601 194 for the protective cover set come out of the closet below.',
    '" Non - Public Personally Identifiable Info " shall be define as this full term operating room a similar term make up define in any applicable land; provincial; territorial dominion; surgery union law. Disclosing or use this data for any purpose non authorize by applicable police force be expressly forbidden without the anterior consent of the Caller.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Training Details

Training Datasets

sscc

  • Dataset: sscc
  • Size: 4,732 training samples
  • Columns: sentence and label
  • Approximate statistics based on the first 1000 samples:
    sentence label
    type string int
    details
    • min: 14 tokens
    • mean: 109.09 tokens
    • max: 128 tokens
    • 0: ~1.50%
    • 1: ~0.10%
    • 2: ~0.20%
    • 3: ~0.30%
    • 4: ~0.30%
    • 5: ~0.70%
    • 6: ~0.20%
    • 7: ~0.10%
    • 8: ~0.10%
    • 9: ~0.10%
    • 10: ~0.10%
    • 11: ~0.10%
    • 12: ~0.20%
    • 13: ~0.70%
    • 14: ~0.20%
    • 15: ~0.20%
    • 16: ~0.80%
    • 17: ~0.50%
    • 18: ~0.50%
    • 19: ~0.60%
    • 20: ~8.00%
    • 21: ~0.40%
    • 22: ~0.90%
    • 23: ~0.40%
    • 24: ~0.40%
    • 25: ~0.90%
    • 26: ~0.10%
    • 27: ~0.10%
    • 28: ~0.60%
    • 29: ~0.30%
    • 30: ~0.50%
    • 31: ~0.20%
    • 32: ~0.30%
    • 33: ~0.10%
    • 34: ~0.40%
    • 35: ~0.10%
    • 36: ~0.10%
    • 37: ~0.50%
    • 38: ~0.70%
    • 39: ~0.10%
    • 40: ~0.20%
    • 41: ~0.20%
    • 42: ~0.70%
    • 43: ~0.10%
    • 44: ~0.60%
    • 45: ~0.30%
    • 46: ~0.80%
    • 47: ~3.90%
    • 48: ~0.30%
    • 49: ~0.10%
    • 50: ~0.40%
    • 51: ~0.60%
    • 52: ~0.50%
    • 53: ~1.70%
    • 54: ~0.10%
    • 55: ~0.30%
    • 56: ~0.20%
    • 57: ~0.60%
    • 58: ~0.10%
    • 59: ~0.10%
    • 60: ~0.20%
    • 61: ~0.30%
    • 62: ~0.30%
    • 63: ~0.50%
    • 64: ~0.20%
    • 65: ~0.30%
    • 66: ~0.20%
    • 67: ~0.20%
    • 68: ~0.50%
    • 69: ~0.20%
    • 70: ~0.20%
    • 71: ~0.10%
    • 72: ~0.10%
    • 73: ~3.30%
    • 74: ~0.70%
    • 75: ~0.60%
    • 76: ~0.20%
    • 77: ~0.40%
    • 78: ~0.30%
    • 79: ~3.50%
    • 80: ~0.50%
    • 81: ~0.40%
    • 82: ~0.10%
    • 83: ~0.20%
    • 84: ~0.50%
    • 85: ~1.80%
    • 86: ~0.30%
    • 87: ~2.70%
    • 88: ~0.20%
    • 89: ~2.90%
    • 90: ~0.30%
    • 91: ~1.60%
    • 92: ~0.10%
    • 93: ~0.50%
    • 94: ~0.40%
    • 95: ~0.10%
    • 96: ~0.60%
    • 97: ~0.40%
    • 98: ~0.10%
    • 99: ~0.20%
    • 100: ~0.10%
    • 101: ~0.30%
    • 102: ~0.30%
    • 103: ~0.80%
    • 104: ~0.10%
    • 105: ~0.20%
    • 106: ~0.50%
    • 107: ~3.60%
    • 108: ~0.10%
    • 109: ~0.10%
    • 110: ~0.20%
    • 111: ~0.40%
    • 112: ~0.20%
    • 113: ~0.20%
    • 114: ~0.10%
    • 115: ~0.10%
    • 116: ~0.10%
    • 117: ~0.20%
    • 118: ~0.40%
    • 119: ~0.10%
    • 120: ~0.10%
    • 121: ~0.10%
    • 122: ~0.10%
    • 123: ~0.10%
    • 124: ~0.20%
    • 125: ~0.10%
    • 127: ~0.10%
    • 128: ~0.30%
    • 129: ~0.30%
    • 130: ~1.20%
    • 131: ~0.10%
    • 132: ~0.20%
    • 133: ~0.10%
    • 134: ~0.20%
    • 135: ~2.10%
    • 136: ~0.30%
    • 137: ~0.50%
    • 138: ~0.40%
    • 139: ~0.70%
    • 140: ~0.10%
    • 141: ~0.30%
    • 142: ~1.00%
    • 143: ~0.20%
    • 144: ~0.60%
    • 145: ~0.10%
    • 146: ~0.30%
    • 147: ~0.10%
    • 148: ~0.10%
    • 149: ~0.20%
    • 150: ~0.70%
    • 151: ~0.10%
    • 152: ~0.30%
    • 153: ~0.10%
    • 154: ~0.20%
    • 155: ~0.10%
    • 156: ~0.10%
    • 157: ~0.10%
    • 158: ~0.10%
    • 159: ~0.30%
    • 160: ~0.20%
    • 161: ~3.70%
    • 162: ~0.10%
    • 163: ~0.10%
    • 164: ~0.10%
    • 165: ~0.20%
    • 166: ~0.20%
    • 167: ~0.10%
    • 168: ~0.20%
    • 169: ~0.20%
    • 170: ~0.20%
    • 171: ~0.50%
    • 172: ~0.10%
    • 173: ~0.10%
    • 174: ~0.30%
    • 175: ~0.80%
    • 176: ~0.80%
    • 177: ~0.50%
    • 178: ~0.40%
    • 179: ~0.30%
    • 180: ~0.30%
    • 181: ~0.20%
    • 182: ~0.10%
    • 183: ~0.10%
    • 184: ~0.10%
    • 185: ~0.10%
    • 186: ~0.10%
    • 187: ~1.00%
    • 188: ~0.10%
    • 189: ~0.20%
    • 190: ~0.40%
    • 191: ~0.10%
    • 192: ~0.20%
    • 193: ~0.10%
    • 194: ~0.10%
    • 195: ~0.50%
    • 196: ~0.10%
    • 197: ~0.50%
    • 198: ~0.10%
    • 199: ~0.20%
    • 200: ~0.20%
    • 201: ~0.10%
    • 202: ~0.10%
    • 203: ~0.20%
    • 204: ~0.10%
    • 205: ~0.10%
    • 206: ~0.10%
    • 207: ~0.10%
    • 208: ~1.10%
    • 209: ~0.10%
    • 210: ~0.20%
    • 211: ~0.10%
    • 212: ~0.10%
    • 213: ~0.10%
    • 214: ~0.10%
    • 215: ~0.10%
    • 216: ~0.30%
    • 217: ~0.30%
    • 218: ~0.10%
    • 219: ~0.10%
    • 220: ~0.10%
    • 221: ~0.10%
    • 222: ~0.10%
    • 223: ~0.10%
    • 224: ~0.10%
    • 225: ~0.20%
    • 226: ~0.10%
    • 227: ~0.10%
    • 228: ~0.10%
    • 229: ~0.10%
    • 230: ~0.10%
    • 231: ~0.10%
    • 232: ~0.10%
    • 233: ~0.10%
    • 234: ~0.10%
  • Samples:
    sentence label
    1. This Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any Pool of Insurers or Reinsurers formed for the purpose of covering Atomic or Nuclear Energy risks.
    2. Without in any way restricting the operation of paragraph (1) of this Clause; this Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any insurance against Physical Damage (including business interruption or consequential loss arising out of such Physical Damage) to:
    I. Nuclear reactor power plants including all auxiliary property on the site; or
    II. Any other nuclear reactor installation; including laboratories handling radioactive materials in connection with reactor installations; and “critical facilities” as such; or
    III. Installations for fabricating complete fuel elements or for processing substantial quantities of “special nuclear material;” and for reprocessing; salvaging; chemically separating; storing or disposing of “spent” nuclear fuel or waste materials; or
    IV. Installations other than those listed in paragraph (2) III above using substantial quantities of radioactive isotopes or other products of nuclear fission.
    3. Without in any way restricting the operations of paragraphs (1) and (2) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; from any insurance on property which is on the same site as a nuclear reactor power plant or other nuclear installation and which normally would be insured therewith except that this paragraph (3) shall not operate
    (a) where the Reassured does not have knowledge of such nuclear reactor power plant or nuclear installation; or
    (b) where said insurance contains a provision excluding coverage for damage to property caused by or resulting from radioactive contamination; however caused. However on and after 1st January 1960 this sub-paragraph (b) shall only apply provided the said radioactive contamination exclusion provision has been approved by the Governmental Authority having jurisdiction thereof.
    4. Without in any way restricting the operations of paragraphs (1); (2) and (3) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; when such radioactive contamination is a named hazard specifically insured against.
    5. It is understood and agreed that this Clause shall not extend to risks using radioactive isotopes in any form where the nuclear exposure is not considered by the Reassured to be the primary hazard.
    6. The term “special nuclear material” shall have the meaning given it in the Atomic Energy Act of 1954 or by any law amendatory thereof.
    7. The Reassured to be sole judge of what constitutes:
    (a) substantial quantities; and
    (b) the extent of installation; plant or site.
    Note. Without in any way restricting the operation of paragraph (1) hereof; it is understood and agreed that
    (a) all policies issued by the Reassured on or before 31st December 1957 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply;
    (b) with respect to any risk located in Canada policies issued by the Reassured on or before 31st December 1958 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply.
    0
    Downgrading clause ~ ABR1001 (Amended)

    Reinsurer with an S&P Rating
    Unless otherwise agreed by the Reinsured; the Reinsurer shall at all times during the Period of this Contract maintain an Insurer Financial Strength (IFS) rating from Standard & Poor's Rating Group of 55 Water Street; New York; NY 10041; USA ("S&P") equal to or greater than a rating of A minus as applied by S&P to that Reinsurer.
    1
    Dispute Resolution ~ ABR1004
    Where any dispute or difference between the parties arising out of or in connection with this Contract; including formation and validity and whether arising during or after the period of this Contract; has not been settled through negotiation; both parties agree to try in good faith to settle such dispute by non- binding mediation; before resorting to arbitration in the manner set out below.
    2
  • Loss: BatchAllTripletLoss

sscd

  • Dataset: sscd
  • Size: 6,232 training samples
  • Columns: sentence and label
  • Approximate statistics based on the first 1000 samples:
    sentence label
    type string int
    details
    • min: 3 tokens
    • mean: 78.01 tokens
    • max: 128 tokens
    • 0: ~0.10%
    • 1: ~1.40%
    • 2: ~0.40%
    • 3: ~3.70%
    • 4: ~0.40%
    • 5: ~0.50%
    • 6: ~0.50%
    • 7: ~0.50%
    • 8: ~2.10%
    • 9: ~0.30%
    • 10: ~0.10%
    • 11: ~0.10%
    • 12: ~0.10%
    • 13: ~0.10%
    • 14: ~0.10%
    • 15: ~0.20%
    • 16: ~0.10%
    • 17: ~0.10%
    • 18: ~0.10%
    • 19: ~1.60%
    • 20: ~0.40%
    • 21: ~0.70%
    • 22: ~0.10%
    • 23: ~0.20%
    • 24: ~0.40%
    • 25: ~0.20%
    • 26: ~0.40%
    • 27: ~0.10%
    • 28: ~0.10%
    • 29: ~1.30%
    • 30: ~0.10%
    • 31: ~0.20%
    • 32: ~3.90%
    • 33: ~1.90%
    • 34: ~0.80%
    • 35: ~0.10%
    • 36: ~2.40%
    • 37: ~0.10%
    • 38: ~0.10%
    • 39: ~1.50%
    • 40: ~0.10%
    • 41: ~0.10%
    • 42: ~0.10%
    • 43: ~0.30%
    • 44: ~0.10%
    • 45: ~0.10%
    • 46: ~0.20%
    • 47: ~0.10%
    • 48: ~0.30%
    • 49: ~0.10%
    • 50: ~0.10%
    • 51: ~0.10%
    • 52: ~0.10%
    • 53: ~0.10%
    • 54: ~5.70%
    • 55: ~0.10%
    • 56: ~0.10%
    • 57: ~0.10%
    • 58: ~0.30%
    • 59: ~0.10%
    • 60: ~0.40%
    • 61: ~0.50%
    • 62: ~1.30%
    • 63: ~1.50%
    • 64: ~0.50%
    • 65: ~0.10%
    • 66: ~0.80%
    • 67: ~0.10%
    • 68: ~0.50%
    • 69: ~1.10%
    • 70: ~0.20%
    • 71: ~0.20%
    • 72: ~0.10%
    • 73: ~0.20%
    • 74: ~1.30%
    • 75: ~0.20%
    • 76: ~0.10%
    • 77: ~0.10%
    • 78: ~0.40%
    • 79: ~0.30%
    • 80: ~0.50%
    • 81: ~0.20%
    • 82: ~0.40%
    • 83: ~0.50%
    • 84: ~1.70%
    • 85: ~0.50%
    • 86: ~0.10%
    • 87: ~0.20%
    • 88: ~0.90%
    • 89: ~0.60%
    • 90: ~0.10%
    • 91: ~0.60%
    • 92: ~0.10%
    • 93: ~0.20%
    • 94: ~0.10%
    • 95: ~0.20%
    • 96: ~0.10%
    • 97: ~0.10%
    • 98: ~0.10%
    • 99: ~0.10%
    • 100: ~0.10%
    • 101: ~1.10%
    • 102: ~0.20%
    • 103: ~0.10%
    • 104: ~0.50%
    • 105: ~0.10%
    • 106: ~0.10%
    • 107: ~0.10%
    • 108: ~0.70%
    • 109: ~0.50%
    • 110: ~0.20%
    • 111: ~0.10%
    • 112: ~0.20%
    • 113: ~0.20%
    • 114: ~0.10%
    • 115: ~0.20%
    • 116: ~0.20%
    • 117: ~0.30%
    • 118: ~0.20%
    • 119: ~0.20%
    • 120: ~0.50%
    • 121: ~0.20%
    • 122: ~0.10%
    • 123: ~0.10%
    • 124: ~0.30%
    • 125: ~0.20%
    • 126: ~0.20%
    • 127: ~0.20%
    • 128: ~0.30%
    • 129: ~0.40%
    • 130: ~0.30%
    • 131: ~0.10%
    • 132: ~0.30%
    • 133: ~0.10%
    • 134: ~0.20%
    • 135: ~0.20%
    • 136: ~0.20%
    • 137: ~0.10%
    • 138: ~0.40%
    • 139: ~0.10%
    • 140: ~0.10%
    • 141: ~0.20%
    • 142: ~0.50%
    • 143: ~0.70%
    • 144: ~0.10%
    • 145: ~0.10%
    • 146: ~0.20%
    • 147: ~0.10%
    • 148: ~0.10%
    • 149: ~0.20%
    • 150: ~0.20%
    • 151: ~0.40%
    • 152: ~0.10%
    • 153: ~0.20%
    • 154: ~0.10%
    • 155: ~0.20%
    • 156: ~0.10%
    • 157: ~0.10%
    • 158: ~0.10%
    • 159: ~0.10%
    • 160: ~0.20%
    • 161: ~0.10%
    • 162: ~0.10%
    • 163: ~0.20%
    • 164: ~0.10%
    • 165: ~0.10%
    • 166: ~0.20%
    • 167: ~0.40%
    • 168: ~0.20%
    • 169: ~0.20%
    • 170: ~0.10%
    • 171: ~0.10%
    • 172: ~0.20%
    • 173: ~0.10%
    • 174: ~0.10%
    • 175: ~0.10%
    • 176: ~0.10%
    • 177: ~0.10%
    • 178: ~0.20%
    • 179: ~0.20%
    • 180: ~0.30%
    • 181: ~0.10%
    • 182: ~0.10%
    • 183: ~1.20%
    • 184: ~0.10%
    • 185: ~0.30%
    • 186: ~0.10%
    • 187: ~0.10%
    • 188: ~0.10%
    • 189: ~0.10%
    • 190: ~0.10%
    • 191: ~0.10%
    • 192: ~0.10%
    • 193: ~0.10%
    • 194: ~0.20%
    • 195: ~0.10%
    • 196: ~0.10%
    • 197: ~0.10%
    • 198: ~0.10%
    • 199: ~0.10%
    • 200: ~0.10%
    • 201: ~0.30%
    • 202: ~0.10%
    • 203: ~1.00%
    • 204: ~0.20%
    • 205: ~0.10%
    • 206: ~0.10%
    • 207: ~0.10%
    • 208: ~0.10%
    • 209: ~0.10%
    • 210: ~0.10%
    • 211: ~0.10%
    • 212: ~0.10%
    • 213: ~0.10%
    • 214: ~0.10%
    • 215: ~0.10%
    • 216: ~0.10%
    • 217: ~0.10%
    • 218: ~0.10%
    • 219: ~0.10%
    • 220: ~0.10%
    • 221: ~0.10%
    • 222: ~0.10%
    • 223: ~0.10%
    • 224: ~0.10%
    • 225: ~0.10%
    • 226: ~0.10%
    • 227: ~0.10%
    • 228: ~0.10%
    • 229: ~0.10%
    • 230: ~0.10%
    • 231: ~0.10%
    • 232: ~0.10%
    • 233: ~0.10%
    • 234: ~0.20%
    • 235: ~0.10%
    • 236: ~0.10%
    • 237: ~0.10%
    • 238: ~0.10%
    • 239: ~0.10%
    • 240: ~0.30%
    • 241: ~0.20%
    • 242: ~0.10%
    • 243: ~0.90%
    • 244: ~0.60%
    • 245: ~0.10%
    • 246: ~0.70%
    • 247: ~0.10%
    • 248: ~0.40%
    • 249: ~0.20%
    • 250: ~0.10%
    • 251: ~0.10%
    • 252: ~0.10%
    • 253: ~0.10%
    • 254: ~0.20%
    • 255: ~0.10%
    • 256: ~0.20%
    • 257: ~0.10%
    • 258: ~0.10%
    • 259: ~0.20%
    • 260: ~0.10%
    • 261: ~0.20%
    • 262: ~0.20%
    • 263: ~0.10%
    • 264: ~0.10%
    • 265: ~0.20%
    • 266: ~0.10%
    • 267: ~0.10%
    • 268: ~0.10%
    • 269: ~0.10%
    • 270: ~0.10%
    • 271: ~0.10%
    • 272: ~0.10%
    • 273: ~0.20%
    • 274: ~0.10%
    • 275: ~0.10%
    • 276: ~0.10%
    • 277: ~0.10%
    • 278: ~0.10%
    • 279: ~0.10%
    • 280: ~0.10%
    • 281: ~0.10%
    • 282: ~0.10%
    • 283: ~0.10%
    • 284: ~0.10%
    • 285: ~0.10%
    • 286: ~0.10%
    • 287: ~0.20%
    • 288: ~0.10%
    • 289: ~0.10%
    • 290: ~0.20%
    • 291: ~0.10%
    • 292: ~0.20%
    • 293: ~0.20%
    • 294: ~0.10%
    • 295: ~0.10%
    • 296: ~0.10%
    • 297: ~0.10%
    • 298: ~0.10%
    • 299: ~0.10%
    • 300: ~0.10%
    • 301: ~0.10%
    • 302: ~0.10%
    • 303: ~0.10%
    • 304: ~0.10%
    • 305: ~0.10%
    • 306: ~0.10%
    • 307: ~0.20%
    • 308: ~0.20%
    • 309: ~0.10%
    • 310: ~0.10%
    • 311: ~0.10%
    • 312: ~0.20%
    • 313: ~0.10%
    • 314: ~0.20%
    • 315: ~0.10%
    • 316: ~0.10%
    • 317: ~0.10%
    • 318: ~0.10%
    • 319: ~0.10%
    • 320: ~0.10%
    • 321: ~0.10%
    • 322: ~0.10%
    • 323: ~0.20%
    • 324: ~0.10%
    • 325: ~0.10%
    • 327: ~0.10%
    • 328: ~0.10%
    • 329: ~0.10%
    • 330: ~0.20%
    • 331: ~0.10%
    • 332: ~0.30%
    • 333: ~0.10%
    • 334: ~0.10%
    • 335: ~0.10%
    • 336: ~0.10%
    • 337: ~0.10%
    • 338: ~0.10%
    • 339: ~0.10%
    • 340: ~0.10%
    • 341: ~0.10%
    • 342: ~0.10%
    • 343: ~0.10%
    • 344: ~0.10%
    • 345: ~0.10%
    • 346: ~0.10%
    • 347: ~0.10%
    • 348: ~0.10%
    • 349: ~0.10%
    • 350: ~0.10%
    • 351: ~0.10%
    • 352: ~0.10%
    • 353: ~0.10%
    • 354: ~0.10%
    • 355: ~0.10%
    • 356: ~0.10%
    • 357: ~0.10%
    • 358: ~0.20%
    • 359: ~0.10%
    • 360: ~0.10%
    • 361: ~0.10%
    • 362: ~0.10%
    • 363: ~0.10%
    • 364: ~0.10%
    • 365: ~0.10%
    • 366: ~0.20%
    • 368: ~0.20%
    • 369: ~0.20%
    • 370: ~0.20%
    • 371: ~0.20%
    • 372: ~0.20%
    • 373: ~0.10%
    • 375: ~0.10%
    • 376: ~0.10%
    • 378: ~0.10%
    • 379: ~0.20%
    • 380: ~0.10%
    • 381: ~0.10%
    • 382: ~0.10%
    • 383: ~0.20%
    • 384: ~0.10%
    • 385: ~0.10%
    • 386: ~0.30%
    • 387: ~0.10%
    • 388: ~0.10%
    • 389: ~0.20%
    • 390: ~0.10%
    • 391: ~0.10%
    • 392: ~0.10%
    • 393: ~0.10%
    • 394: ~0.10%
    • 395: ~0.10%
    • 396: ~0.10%
    • 397: ~0.10%
    • 398: ~0.10%
  • Samples:
    sentence label
    “North American CAT Perils” means certain Named Storms and Earthquake; each as defined below; in respect of that portion of losses which occur in the United States and Canada and their possessions and territories; excluding the Territory of Guam; the Territory of American Samoa; the Commonwealth of the Northern Mariana Islands; Wake Island; Johnston Atoll; Palmyra Atoll; and the State of Hawaiiterritory of Guam. 0
    For the purposes of this Paragraph A.; “Named Storm” means any windstorm or windstorm system that has been named by a Reporting Agency at any time in its lifecycle and ensuing losses therefrom. 1
    For the purposes of this Paragraph A.; “Earthquake” means earthquake shake and ensuing losses therefrom. 2
  • Loss: BatchAllTripletLoss

apnc

  • Dataset: apnc
  • Size: 4,732 training samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 14 tokens
    • mean: 109.09 tokens
    • max: 128 tokens
    • min: 14 tokens
    • mean: 109.09 tokens
    • max: 128 tokens
    • min: 9 tokens
    • mean: 104.08 tokens
    • max: 128 tokens
  • Samples:
    anchor positive negative
    1. This Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any Pool of Insurers or Reinsurers formed for the purpose of covering Atomic or Nuclear Energy risks.
    2. Without in any way restricting the operation of paragraph (1) of this Clause; this Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any insurance against Physical Damage (including business interruption or consequential loss arising out of such Physical Damage) to:
    I. Nuclear reactor power plants including all auxiliary property on the site; or
    II. Any other nuclear reactor installation; including laboratories handling radioactive materials in connection with reactor installations; and “critical facilities” as such; or
    III. Installations for fabricating complete fuel elements or for processing substantial quantities of “special nuclear material;” and for reprocessing; salvaging; chemically separating; storing or disposing of “spent” nuclear fuel or waste materials; or
    IV. Installations other than those listed in paragraph (2) III above using substantial quantities of radioactive isotopes or other products of nuclear fission.
    3. Without in any way restricting the operations of paragraphs (1) and (2) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; from any insurance on property which is on the same site as a nuclear reactor power plant or other nuclear installation and which normally would be insured therewith except that this paragraph (3) shall not operate
    (a) where the Reassured does not have knowledge of such nuclear reactor power plant or nuclear installation; or
    (b) where said insurance contains a provision excluding coverage for damage to property caused by or resulting from radioactive contamination; however caused. However on and after 1st January 1960 this sub-paragraph (b) shall only apply provided the said radioactive contamination exclusion provision has been approved by the Governmental Authority having jurisdiction thereof.
    4. Without in any way restricting the operations of paragraphs (1); (2) and (3) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; when such radioactive contamination is a named hazard specifically insured against.
    5. It is understood and agreed that this Clause shall not extend to risks using radioactive isotopes in any form where the nuclear exposure is not considered by the Reassured to be the primary hazard.
    6. The term “special nuclear material” shall have the meaning given it in the Atomic Energy Act of 1954 or by any law amendatory thereof.
    7. The Reassured to be sole judge of what constitutes:
    (a) substantial quantities; and
    (b) the extent of installation; plant or site.
    Note. Without in any way restricting the operation of paragraph (1) hereof; it is understood and agreed that
    (a) all policies issued by the Reassured on or before 31st December 1957 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply;
    (b) with respect to any risk located in Canada policies issued by the Reassured on or before 31st December 1958 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply.
    1. This Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any Pool of Insurers or Reinsurers formed for the purpose of covering Atomic or Nuclear Energy risks.
    2. Without in any way restricting the operation of paragraph (1) of this Clause; this Contract does not cover any loss or liability accruing to the Reassured; directly or indirectly and whether as Insurer or Reinsurer; from any insurance against Physical Damage (including business interruption or consequential loss arising out of such Physical Damage) to:
    I. Nuclear reactor power plants including all auxiliary property on the site; or
    II. Any other nuclear reactor installation; including laboratories handling radioactive materials in connection with reactor installations; and “critical facilities” as such; or
    III. Installations for fabricating complete fuel elements or for processing substantial quantities of “special nuclear material;” and for reprocessing; salvaging; chemically separating; storing or disposing of “spent” nuclear fuel or waste materials; or
    IV. Installations other than those listed in paragraph (2) III above using substantial quantities of radioactive isotopes or other products of nuclear fission.
    3. Without in any way restricting the operations of paragraphs (1) and (2) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; from any insurance on property which is on the same site as a nuclear reactor power plant or other nuclear installation and which normally would be insured therewith except that this paragraph (3) shall not operate
    (a) where the Reassured does not have knowledge of such nuclear reactor power plant or nuclear installation; or
    (b) where said insurance contains a provision excluding coverage for damage to property caused by or resulting from radioactive contamination; however caused. However on and after 1st January 1960 this sub-paragraph (b) shall only apply provided the said radioactive contamination exclusion provision has been approved by the Governmental Authority having jurisdiction thereof.
    4. Without in any way restricting the operations of paragraphs (1); (2) and (3) hereof; this Contract does not cover any loss or liability by radioactive contamination accruing to the Reassured; directly or indirectly; and whether as Insurer or Reinsurer; when such radioactive contamination is a named hazard specifically insured against.
    5. It is understood and agreed that this Clause shall not extend to risks using radioactive isotopes in any form where the nuclear exposure is not considered by the Reassured to be the primary hazard.
    6. The term “special nuclear material” shall have the meaning given it in the Atomic Energy Act of 1954 or by any law amendatory thereof.
    7. The Reassured to be sole judge of what constitutes:
    (a) substantial quantities; and
    (b) the extent of installation; plant or site.
    Note. Without in any way restricting the operation of paragraph (1) hereof; it is understood and agreed that
    (a) all policies issued by the Reassured on or before 31st December 1957 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply;
    (b) with respect to any risk located in Canada policies issued by the Reassured on or before 31st December 1958 shall be free from the application of the other provisions of this Clause until expiry date or 31st December 1960 whichever first occurs whereupon all the provisions of this Clause shall apply.
    This Contract shall exclude: a) Business defined by the Reinsured as Liability Business (unless included in Cargo or Engineering All Risks/Contractors All Risks Business). b) Space and related risks. c) Marine business; but not applying to pleasure craft. d) Disease losses in respect of Fish Farm. This Contract shall also be subject to the following exclusion clauses: a) War and Civil War Exclusion NMA 464.
    Downgrading clause ~ ABR1001 (Amended)

    Reinsurer with an S&P Rating
    Unless otherwise agreed by the Reinsured; the Reinsurer shall at all times during the Period of this Contract maintain an Insurer Financial Strength (IFS) rating from Standard & Poor's Rating Group of 55 Water Street; New York; NY 10041; USA ("S&P") equal to or greater than a rating of A minus as applied by S&P to that Reinsurer.
    Downgrading clause ~ ABR1001 (Amended)

    Reinsurer with an S&P Rating
    Unless otherwise agreed by the Reinsured; the Reinsurer shall at all times during the Period of this Contract maintain an Insurer Financial Strength (IFS) rating from Standard & Poor's Rating Group of 55 Water Street; New York; NY 10041; USA ("S&P") equal to or greater than a rating of A minus as applied by S&P to that Reinsurer.
    Communicable disease clause - LMA 5394
    1. Notwithstanding any provision to the contrary within this reinsurance agreement; this reinsurance agreement excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto.
    2. As used herein; a Communicable Disease means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: 2.1. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and 2.2. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and 2.3. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property.
    Dispute Resolution ~ ABR1004
    Where any dispute or difference between the parties arising out of or in connection with this Contract; including formation and validity and whether arising during or after the period of this Contract; has not been settled through negotiation; both parties agree to try in good faith to settle such dispute by non- binding mediation; before resorting to arbitration in the manner set out below.
    Dispute Resolution ~ ABR1004
    Where any dispute or difference between the parties arising out of or in connection with this Contract; including formation and validity and whether arising during or after the period of this Contract; has not been settled through negotiation; both parties agree to try in good faith to settle such dispute by non- binding mediation; before resorting to arbitration in the manner set out below.
    Brokerage for this Contract is 15.00% of gross ceded premium. No brokerage will be paid on reinstatement premium.
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

apnd

  • Dataset: apnd
  • Size: 6,232 training samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 3 tokens
    • mean: 78.01 tokens
    • max: 128 tokens
    • min: 3 tokens
    • mean: 78.01 tokens
    • max: 128 tokens
    • min: 5 tokens
    • mean: 74.13 tokens
    • max: 128 tokens
  • Samples:
    anchor positive negative
    “North American CAT Perils” means certain Named Storms and Earthquake; each as defined below; in respect of that portion of losses which occur in the United States and Canada and their possessions and territories; excluding the Territory of Guam; the Territory of American Samoa; the Commonwealth of the Northern Mariana Islands; Wake Island; Johnston Atoll; Palmyra Atoll; and the State of Hawaiiterritory of Guam. “North American CAT Perils” means certain Named Storms and Earthquake; each as defined below; in respect of that portion of losses which occur in the United States and Canada and their possessions and territories; excluding the Territory of Guam; the Territory of American Samoa; the Commonwealth of the Northern Mariana Islands; Wake Island; Johnston Atoll; Palmyra Atoll; and the State of Hawaiiterritory of Guam. 'Insurance Compensation' shall mean any compensation; interest or Allocated Expenses paid or payable by the Reinsured in respect of any loss occurrence under Policies covered under this Agreement.
    For the purposes of this Paragraph A.; “Named Storm” means any windstorm or windstorm system that has been named by a Reporting Agency at any time in its lifecycle and ensuing losses therefrom. For the purposes of this Paragraph A.; “Named Storm” means any windstorm or windstorm system that has been named by a Reporting Agency at any time in its lifecycle and ensuing losses therefrom. ‘Contingency policies’ means contracts of contingency insurance unless: a) written as an integral component of General Cover or b) the subject of a binding written commitment on or before 31 December 2018 and incepting or renewing on or before 31 March 2019.
    For the purposes of this Paragraph A.; “Earthquake” means earthquake shake and ensuing losses therefrom. For the purposes of this Paragraph A.; “Earthquake” means earthquake shake and ensuing losses therefrom. Means any programme code; programming instruction or other set of instructions intentionally constructed with the ability to damage; interfere with or otherwise adversely affect computer programmes; data files or operations (whether involving self-replication or not); including but not limited to “Virus;” “Trojan Horses;” “Worms;” “Logic Bombs;” or “Denial of Service Attack.”
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Evaluation Datasets

sscc

  • Dataset: sscc
  • Size: 100 evaluation samples
  • Columns: sentence and label
  • Approximate statistics based on the first 1000 samples:
    sentence label
    type string int
    details
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • 1: ~1.00%
    • 4: ~2.00%
    • 5: ~3.00%
    • 11: ~1.00%
    • 12: ~1.00%
    • 18: ~1.00%
    • 19: ~6.00%
    • 20: ~8.00%
    • 21: ~3.00%
    • 30: ~1.00%
    • 32: ~1.00%
    • 34: ~1.00%
    • 45: ~1.00%
    • 46: ~2.00%
    • 47: ~2.00%
    • 50: ~1.00%
    • 51: ~1.00%
    • 53: ~4.00%
    • 65: ~1.00%
    • 68: ~2.00%
    • 69: ~1.00%
    • 74: ~1.00%
    • 79: ~5.00%
    • 88: ~1.00%
    • 89: ~1.00%
    • 93: ~1.00%
    • 107: ~3.00%
    • 115: ~1.00%
    • 126: ~1.00%
    • 143: ~1.00%
    • 150: ~1.00%
    • 161: ~1.00%
    • 175: ~1.00%
    • 202: ~1.00%
    • 220: ~1.00%
    • 222: ~1.00%
    • 227: ~1.00%
    • 229: ~1.00%
    • 231: ~1.00%
    • 232: ~1.00%
    • 235: ~1.00%
    • 236: ~1.00%
    • 251: ~2.00%
    • 275: ~1.00%
    • 276: ~1.00%
    • 296: ~1.00%
    • 305: ~1.00%
    • 309: ~1.00%
    • 314: ~1.00%
    • 334: ~1.00%
    • 342: ~1.00%
    • 368: ~1.00%
    • 376: ~1.00%
    • 381: ~1.00%
    • 400: ~1.00%
    • 402: ~1.00%
    • 404: ~1.00%
    • 419: ~1.00%
    • 473: ~1.00%
    • 496: ~1.00%
    • 547: ~1.00%
    • 548: ~1.00%
    • 553: ~1.00%
    • 585: ~1.00%
    • 594: ~2.00%
    • 605: ~1.00%
    • 606: ~1.00%
    • 662: ~1.00%
    • 798: ~1.00%
  • Samples:
    sentence label
    Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. 32
    Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. 79
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)
    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back
    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:
    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;
    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.
    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    88
  • Loss: BatchAllTripletLoss

sscd

  • Dataset: sscd
  • Size: 100 evaluation samples
  • Columns: sentence and label
  • Approximate statistics based on the first 1000 samples:
    sentence label
    type string int
    details
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • 1: ~1.00%
    • 3: ~8.00%
    • 4: ~1.00%
    • 5: ~1.00%
    • 7: ~1.00%
    • 8: ~2.00%
    • 19: ~4.00%
    • 25: ~1.00%
    • 26: ~2.00%
    • 29: ~2.00%
    • 32: ~2.00%
    • 33: ~2.00%
    • 34: ~1.00%
    • 38: ~1.00%
    • 39: ~1.00%
    • 54: ~3.00%
    • 55: ~1.00%
    • 68: ~1.00%
    • 78: ~2.00%
    • 80: ~1.00%
    • 82: ~1.00%
    • 84: ~1.00%
    • 93: ~1.00%
    • 98: ~1.00%
    • 120: ~1.00%
    • 134: ~1.00%
    • 135: ~1.00%
    • 143: ~1.00%
    • 144: ~2.00%
    • 149: ~1.00%
    • 154: ~1.00%
    • 161: ~1.00%
    • 173: ~1.00%
    • 180: ~1.00%
    • 181: ~1.00%
    • 183: ~2.00%
    • 206: ~1.00%
    • 236: ~1.00%
    • 238: ~1.00%
    • 239: ~1.00%
    • 243: ~1.00%
    • 244: ~1.00%
    • 256: ~1.00%
    • 264: ~1.00%
    • 326: ~1.00%
    • 361: ~1.00%
    • 367: ~1.00%
    • 374: ~1.00%
    • 377: ~1.00%
    • 429: ~1.00%
    • 433: ~1.00%
    • 443: ~1.00%
    • 448: ~1.00%
    • 473: ~1.00%
    • 488: ~1.00%
    • 521: ~1.00%
    • 535: ~1.00%
    • 556: ~1.00%
    • 557: ~1.00%
    • 580: ~1.00%
    • 589: ~1.00%
    • 679: ~1.00%
    • 693: ~1.00%
    • 797: ~1.00%
    • 857: ~1.00%
    • 859: ~1.00%
    • 871: ~1.00%
    • 873: ~1.00%
    • 960: ~1.00%
    • 979: ~1.00%
    • 1028: ~1.00%
    • 1155: ~1.00%
    • 1209: ~1.00%
    • 1213: ~1.00%
    • 1256: ~1.00%
    • 1297: ~1.00%
    • 1331: ~1.00%
    • 1481: ~1.00%
    • 1528: ~1.00%
    • 1541: ~1.00%
  • Samples:
    sentence label
    “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. 4
    “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. 25
    means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. 7
  • Loss: BatchAllTripletLoss

mlmc

  • Dataset: mlmc
  • Size: 100 evaluation samples
  • Columns: anchor and positive
  • Approximate statistics based on the first 1000 samples:
    anchor positive
    type string string
    details
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • min: 22 tokens
    • mean: 112.7 tokens
    • max: 128 tokens
  • Samples:
    anchor positive
    Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. Article 26 - [MASK] Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the [MASK] payable hereon (as [MASK] under [MASK] 4371 of the [MASK] Revenue Code) to [MASK] extent such premium is subject to the Federal Excise Tax.
    Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. Notwithstanding any provision to the contrary within [MASK] Reinsurance Contract; this Reinsurance Contract excludes [MASK] loss; damage; liability; [MASK] cost or expense of whatsoever nature; directly or indirectly [MASK] by; [MASK] to by; [MASK] from; arising out of; or in connection with a [MASK] Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing [MASK] or in any [MASK] sequence thereto.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)

    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back

    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:

    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;

    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.

    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    CYBER LOSS LIMITED EXCLUSION [MASK] (PROPERTY TREATY REINSURANCE) Based on [MASK] [MASK] - Amended to clarify [MASK] of coverage in the [MASK] 1. [MASK] any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; [MASK] cost or expense of whatsoever nature [MASK] or indirectly caused by; contributed to by; resulting from; arising out of or in connection with: 1.1 any loss of; alteration of; or damage to or a reduction in [MASK] functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2; [MASK] any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the [MASK] of [MASK] [MASK] 2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement [MASK] cover physical [MASK] to property insured under the original policies and any Time Element [MASK] directly resulting therefrom where such physical damage is directly occasioned by a [MASK] [MASK] covered hereunder.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

mlmd

  • Dataset: mlmd
  • Size: 100 evaluation samples
  • Columns: anchor and positive
  • Approximate statistics based on the first 1000 samples:
    anchor positive
    type string string
    details
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • min: 5 tokens
    • mean: 78.55 tokens
    • max: 128 tokens
  • Samples:
    anchor positive
    “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. “Communicable Disease” means any disease which can be transmitted by [MASK] of [MASK] [MASK] or agent from any organism to another organism where: a. the substance or agent includes; [MASK] is not [MASK] to; a virus; bacterium; parasite or other [MASK] or [MASK] variation thereof; whether deemed living or [MASK] and b. the method of transmission; [MASK] direct or indirect; includes [MASK] is not [MASK] to; airborne transmission; bodily fluid transmission; transmission from or to any [MASK] or object; solid; liquid or gas or between organisms; [MASK] c. the [MASK] substance or agent can cause or threaten damage to human health or human [MASK] or can cause or [MASK] damage to; deterioration of; loss of value of; marketability of or [MASK] of use of property.
    “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. “Production; Use or Storage of Nuclear Material” means the production; manufacture; [MASK] conditioning; processing; reprocessing; use; [MASK] [MASK] and disposal of Nuclear Material.
    means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. [MASK] information; facts; concepts; [MASK] or any other information of any kind [MASK] is recorded or transmitted in a [MASK] to be used; accessed; processed; transmitted or stored by a Computer System.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

apnc

  • Dataset: apnc
  • Size: 100 evaluation samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
  • Samples:
    anchor positive negative
    Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. ARTICLE XXXII NON-ASSIGNABILITY A. The Reinsurer shall not reinsure or otherwise assign or transfer its entire liability or obligations under this Contract without the Company’s prior written consent. B. The Reinsurer shall not transfer its claims-paying authority under this Contract to an unaffiliated entity or in any other way assign its interests or delegate its obligations under this Contract to an unaffiliated entity without the Company’s prior written consent. Notwithstanding the foregoing; the transfer of claims-paying authority or administration to a third party; where the subscribing reinsurer maintains control over claims settlement decisions; will not constitute a transfer of its claims-paying authority for purposes of this subparagraph.
    Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. The obligations and duties of a Subscribing Reinsurer under this Contract shall not be assigned
    to or assumed by another reinsurer without the prior written consent of the Company.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)
    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back
    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:
    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;
    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.
    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)
    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back
    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:
    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;
    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.
    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    The Reinsurer agrees not to disclose any Confidential Information which it may acquire in connection with this Contract; except: 1. To its professional advisors; auditors; attorneys; and other consultants on a need-to-know basis; 2. To any of its affiliates and to the directors; officers; employees; professional advisors; auditors; attorneys; and other consultants of such affiliates on a need-to-know basis; 3. To any other party to whom such disclosure is necessary for the Reinsurer to enforce its rights hereunder; 4. To any party from whom the Reinsurer is seeking or from whom the Reinsurer has obtained reinsurance as long as the disclosing party is under a similar obligation of confidentiality as the Reinsurer is under this Article; 5. When required for the Reinsurer's internal operations. Further; the Reinsurer agrees not to use any Confidential Information for any purpose not related to the performance of its obligations or enforcement of its rights under this Contract. Notwithstanding the above; in the event that the Reinsurer is required by court order; other legal process or any regulatory authority to release or disclose any or all of the Confidential Information; the Reinsurer agrees to provide the Company with written notice of same at least 10 days prior to such release or disclosure and to use its best efforts to assist the Company in maintaining the confidentiality provided for in this Article. The provisions of this Article shall apply to renewal information provided to the Reinsurer by the Company prior to or upon the expiration or termination of this Contract.
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

apnd

  • Dataset: apnd
  • Size: 100 evaluation samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • min: 5 tokens
    • mean: 71.3 tokens
    • max: 128 tokens
  • Samples:
    anchor positive negative
    “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. shall be defined as the sum of all losses directly occasioned by any one disaster; accident or loss or series of disasters; accidents or losses arising out of one event. The duration and extent of any one Loss Occurrence will be limited to all individual losses sustained by the Company occurring during any period of 168 consecutive hours (except as otherwise provided below) arising out of and directly occasioned by the same event except that the term Loss Occurrence will be further defined as follows: 1. As regards windstorm; hail; tornado; including ensuing collapse and water damage; all individual losses sustained by the Company occurring during any period of 168 consecutive hours arising out of and directly occasioned by the same event. Notwithstanding the foregoing; as respects Named Storm only; the period of consecutive hours applicable to such Named Storm may be extended beyond 168 hours in accordance with the provisions of paragraph E below. 2. As regards riot; riot attending a strike; civil commotion; vandalism and malicious mischief; all individual losses sustained by the Company occurring during any period of 96 consecutive hours arising out of and directly occasioned by the same event. The maximum duration of 96 consecutive hours may be extended in respect of individual losses which occur beyond such 96 consecutive hours during the continued occupation of an insured’s premises by strikers or locked-out workers; provided such occupation commenced during the aforesaid period. 3. As regards earthquake (the epicenter of which need not necessarily be within the territorial confines referred to in the Territory Article) and fire following directly occasioned by the earthquake; only those earthquake losses and individual fire losses which commence during the period of 168 consecutive hours may be included in any one Loss Occurrence. 4. As regards freezing; frost; ice; snow; sleet; including weight of snow; ice or sleet; collapse of buildings; breakage of glass and water damage (caused by bursting of frozen pipes and tanks or freezing and/or melting snow or sleet; including but not limited to ice dams); as well as other perils; all individual losses sustained by the Company occurring during any period of 72 consecutive hours arising out of and directly occasioned by the same event.
    “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. “Contract” shall be understood to mean “Contract;” “Policy” or whatever other term is used to designate the attached reinsurance document.
    means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. 13. Self-Insurance applicable to section D (Eureko Sigorta A.Ş;) of the Risk details
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

nlpaug_sc

  • Dataset: nlpaug_sc
  • Size: 100 evaluation samples
  • Columns: sentence1, label, and sentence2
  • Approximate statistics based on the first 1000 samples:
    sentence1 label sentence2
    type string int string
    details
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • 1: ~1.00%
    • 4: ~2.00%
    • 5: ~3.00%
    • 11: ~1.00%
    • 12: ~1.00%
    • 18: ~1.00%
    • 19: ~6.00%
    • 20: ~8.00%
    • 21: ~3.00%
    • 30: ~1.00%
    • 32: ~1.00%
    • 34: ~1.00%
    • 45: ~1.00%
    • 46: ~2.00%
    • 47: ~2.00%
    • 50: ~1.00%
    • 51: ~1.00%
    • 53: ~4.00%
    • 65: ~1.00%
    • 68: ~2.00%
    • 69: ~1.00%
    • 74: ~1.00%
    • 79: ~5.00%
    • 88: ~1.00%
    • 89: ~1.00%
    • 93: ~1.00%
    • 107: ~3.00%
    • 115: ~1.00%
    • 126: ~1.00%
    • 143: ~1.00%
    • 150: ~1.00%
    • 161: ~1.00%
    • 175: ~1.00%
    • 202: ~1.00%
    • 220: ~1.00%
    • 222: ~1.00%
    • 227: ~1.00%
    • 229: ~1.00%
    • 231: ~1.00%
    • 232: ~1.00%
    • 235: ~1.00%
    • 236: ~1.00%
    • 251: ~2.00%
    • 275: ~1.00%
    • 276: ~1.00%
    • 296: ~1.00%
    • 305: ~1.00%
    • 309: ~1.00%
    • 314: ~1.00%
    • 334: ~1.00%
    • 342: ~1.00%
    • 368: ~1.00%
    • 376: ~1.00%
    • 381: ~1.00%
    • 400: ~1.00%
    • 402: ~1.00%
    • 404: ~1.00%
    • 419: ~1.00%
    • 473: ~1.00%
    • 496: ~1.00%
    • 547: ~1.00%
    • 548: ~1.00%
    • 553: ~1.00%
    • 585: ~1.00%
    • 594: ~2.00%
    • 605: ~1.00%
    • 606: ~1.00%
    • 662: ~1.00%
    • 798: ~1.00%
    • min: 21 tokens
    • mean: 110.01 tokens
    • max: 128 tokens
  • Samples:
    sentence1 label sentence2
    Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. 32 Clause 26 - Federal Excise tax Taxation (BRMA 17D) A. The Reinsurer have agreed to take into account for the use of paying the Fed Excise tax Revenue enhancement the applicable part of the agio collectable hereon (as levy under Subdivision 4371 of the Internal Gross Code) to the extent such agiotage be open to the Federal Excise tax Taxation.
    Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. 79 Notwithstanding any proviso to the opposite within this Reinsurance Contract; this Reinsurance Contract bridge shut any passing; damage; indebtedness; claim; cost or disbursement of any nature; immediately beaver state indirectly stimulate by; give to by; result from; get up forbidden of; or in connection with a Contagious Disease or the fright operating theatre terror (whether actual or perceive) of a Catching Disease regardless of any other cause operating theater result lend at the same time operating theatre in any early succession thereto.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)
    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back
    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:
    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;
    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.
    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    88 CYBER release express EXCLUSION article (prop pact REINSURANCE) Based on LMA 5410 - Amended to clarify consistency of coverage in the write - rearwards one. Notwithstanding any provision to the reverse within this reinsurance accord operating theater any endorsement thereto; this reinsurance concord exclude all red ink; legal injury; liability; cost or disbursal of whatever nature directly surgery indirectly get by; contributed to by; result from; spring up prohibited of operating theatre in connecter with: unity. ace any loss of; alteration of; operating theatre damage to operating theatre a reduction in the functionality; availability or operation of a Calculator Organisation; unless content to the victuals of paragraph two; 1. two any loss of use; diminution in functionality; repair; substitute; restoration beaver state breeding of any Datum; include any measure pertaining to the note value of such Data point. 2. Dependent to the early full term; status and exclusion contain in this reinsurance concord; this reinsurance understanding testament handle strong arm harm to attribute insured nether the original policy and any Clock time Chemical element Going directly resulting therefrom where such physical damage be straight occasion by a peril other than overlay hereunder.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

nlpaug_sd

  • Dataset: nlpaug_sd
  • Size: 100 evaluation samples
  • Columns: sentence1, label, and sentence2
  • Approximate statistics based on the first 1000 samples:
    sentence1 label sentence2
    type string int string
    details
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • 1: ~1.00%
    • 3: ~8.00%
    • 4: ~1.00%
    • 5: ~1.00%
    • 7: ~1.00%
    • 8: ~2.00%
    • 19: ~4.00%
    • 25: ~1.00%
    • 26: ~2.00%
    • 29: ~2.00%
    • 32: ~2.00%
    • 33: ~2.00%
    • 34: ~1.00%
    • 38: ~1.00%
    • 39: ~1.00%
    • 54: ~3.00%
    • 55: ~1.00%
    • 68: ~1.00%
    • 78: ~2.00%
    • 80: ~1.00%
    • 82: ~1.00%
    • 84: ~1.00%
    • 93: ~1.00%
    • 98: ~1.00%
    • 120: ~1.00%
    • 134: ~1.00%
    • 135: ~1.00%
    • 143: ~1.00%
    • 144: ~2.00%
    • 149: ~1.00%
    • 154: ~1.00%
    • 161: ~1.00%
    • 173: ~1.00%
    • 180: ~1.00%
    • 181: ~1.00%
    • 183: ~2.00%
    • 206: ~1.00%
    • 236: ~1.00%
    • 238: ~1.00%
    • 239: ~1.00%
    • 243: ~1.00%
    • 244: ~1.00%
    • 256: ~1.00%
    • 264: ~1.00%
    • 326: ~1.00%
    • 361: ~1.00%
    • 367: ~1.00%
    • 374: ~1.00%
    • 377: ~1.00%
    • 429: ~1.00%
    • 433: ~1.00%
    • 443: ~1.00%
    • 448: ~1.00%
    • 473: ~1.00%
    • 488: ~1.00%
    • 521: ~1.00%
    • 535: ~1.00%
    • 556: ~1.00%
    • 557: ~1.00%
    • 580: ~1.00%
    • 589: ~1.00%
    • 679: ~1.00%
    • 693: ~1.00%
    • 797: ~1.00%
    • 857: ~1.00%
    • 859: ~1.00%
    • 871: ~1.00%
    • 873: ~1.00%
    • 960: ~1.00%
    • 979: ~1.00%
    • 1028: ~1.00%
    • 1155: ~1.00%
    • 1209: ~1.00%
    • 1213: ~1.00%
    • 1256: ~1.00%
    • 1297: ~1.00%
    • 1331: ~1.00%
    • 1481: ~1.00%
    • 1528: ~1.00%
    • 1541: ~1.00%
    • min: 5 tokens
    • mean: 75.49 tokens
    • max: 128 tokens
  • Samples:
    sentence1 label sentence2
    “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. 4 “ Communicable Disease ” intend any disease which tin be transmit by way of any inwardness or factor from any being to another being where: a. the center beaver state agent include; but is non limited to; a virus; bacteria; sponger beaver state former being oregon any variance thereof; whether deemed living operating theater not; and b. the method of transmission; whether direct operating room indirect; includes only be non special to; airborne transmission; bodily fluid transmission system; transmittal from or to any surface or object; solid; liquid operating theatre gas beaver state between organism; and c. the disease; meaning surgery broker can cause or jeopardise scathe to human wellness operating room human eudaimonia or potty stimulate surgery peril harm to; declension of; passing of time value of; marketability of or going of use of holding.
    “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. 25 “ Product; Use operating room Storage of Nuclear Cloth ” imply the product; manufacture; enrichment; conditioning; processing; reprocessing; economic consumption; computer storage; handling and administration of Nuclear Fabric.
    means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. 7 mean value info; facts; conception; code beaver state any former information of any kind that is put down beaver state transmit in a contour to follow used; get at; sue; broadcast operating theatre store by a Computer Organisation.
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

nlpaug_nc

  • Dataset: nlpaug_nc
  • Size: 100 evaluation samples
  • Columns: original_sentence and damaged_sentence
  • Approximate statistics based on the first 1000 samples:
    original_sentence damaged_sentence
    type string string
    details
    • min: 20 tokens
    • mean: 104.63 tokens
    • max: 128 tokens
    • min: 24 tokens
    • mean: 112.69 tokens
    • max: 128 tokens
  • Samples:
    original_sentence damaged_sentence
    Article 26 - Federal Excise Tax (BRMA 17D) A.The Reinsurer has agreed to allow for the purpose of paying the Federal Excise Tax the applicable percentage of the premium payable hereon (as imposed under Section 4371 of the Internal Revenue Code) to the extent such premium is subject to the Federal Excise Tax. Articel 26rd - Federal Excise Tax (17D BRMA) At. There Reinsurer hat agreeded to phone allown tha pupose of pay thre Federal Excise Tax applicable tho porcentage tte ok premiun payable hereon (als imposed ander 4371 Section on Internal ttt Code Revenue) ta extend thes souch premiun i ' ts subjekt thye ton Excise Federal Tax.
    Notwithstanding any provision to the contrary within this Reinsurance Contract; this Reinsurance Contract excludes any loss; damage; liability; claim; cost or expense of whatsoever nature; directly or indirectly caused by; contributed to by; resulting from; arising out of; or in connection with a Communicable Disease or the fear or threat (whether actual or perceived) of a Communicable Disease regardless of any other cause or event contributing concurrently or in any other sequence thereto. Notwhitstanding eany prevision so [[ th contraty whit tus Reinsurance Contract; theis Reinsurance ani Contract excludes lose; demage; libility clam; ; coasts on expense fot whatsoever natural; dirctly or indirectly coused by; contributed tm by; resulting; from arising cult oh; oe nin connection mit a Disease Communicable or the ot fear threat (actuel wether ora perceived) and af Communicable Disease [regardles [jf eany oth cuse nor evet contributing concurrently os in orhers andy senquency thereto.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE)
    Based on LMA 5410 - Amended to clarify consistency of coverage in the write-back
    1. Notwithstanding any provision to the contrary within this reinsurance agreement or any endorsement thereto; this reinsurance agreement excludes all loss; damage; liability; cost or expense of whatsoever nature directly or indirectly caused by; contributed to by; resulting from; arising out of or in connection with:
    1.1 any loss of; alteration of; or damage to or a reduction in the functionality; availability or operation of a Computer System; unless subject to the provisions of paragraph 2;
    1.2 any loss of use; reduction in functionality; repair; replacement; restoration or reproduction of any Data; including any amount pertaining to the value of such Data.
    2. Subject to the other terms; conditions and exclusions contained in this reinsurance agreement; this reinsurance agreement will cover physical damage to property insured under the original policies and any Time Element Loss directly resulting therefrom where such physical damage is directly occasioned by a peril otherwise covered hereunder.
    CYBER LOSS LIMITED EXCLUSION CLAUSE (PROPERTY TREATY REINSURANCE) Besed om LMA 5410 - Amended to clarfy consistency of coverage ein thar write - 10 hack. Notwithstanding anny prevision tm the contrary withim ths reinsurance ageement ore an endorsement thereto; htis reinsurance agrement excludes alll lose; dammage; libility; costs expence ore of whatsoever direectly hature ore coused indirectly yb; contributed to bye; resulting from; arising oud f or yn connection vith: 1 1. andy lose ofo; alteration off or; demage tou ore g reduction ii thre functionality; availability or operation ol and Sistem Camputer; unless subgect t ghe provisions of praghraph Two; l. two any lose oft use; réductions int functionality; repear; relacement; restortion ow reproduction of anny Date; incluses eny amouth pertaining o tie vlue of such Data. Two. por Subjekt tne anoter therms; condicions ah exclusions contain in this reinsurance agretment; thous reinsurance agrement weill cover physiscal so demage propertie insured under rthe original polices many ve Time Element Loss directly resulting therefrom wher cush physiscal dammage ist dircetly occasioned bye! I perfil othervise convered hereunder.
  • Loss: DenoisingAutoEncoderLoss

nlpaug_nd

  • Dataset: nlpaug_nd
  • Size: 100 evaluation samples
  • Columns: original_sentence and damaged_sentence
  • Approximate statistics based on the first 1000 samples:
    original_sentence damaged_sentence
    type string string
    details
    • min: 5 tokens
    • mean: 70.36 tokens
    • max: 128 tokens
    • min: 5 tokens
    • mean: 79.57 tokens
    • max: 128 tokens
  • Samples:
    original_sentence damaged_sentence
    “Communicable Disease” means any disease which can be transmitted by means of any substance or agent from any organism to another organism where: a. the substance or agent includes; but is not limited to; a virus; bacterium; parasite or other organism or any variation thereof; whether deemed living or not; and b. the method of transmission; whether direct or indirect; includes but is not limited to; airborne transmission; bodily fluid transmission; transmission from or to any surface or object; solid; liquid or gas or between organisms; and c. the disease; substance or agent can cause or threaten damage to human health or human welfare or can cause or threaten damage to; deterioration of; loss of value of; marketability of or loss of use of property. “ Communicable Disease ” meen an disease wich bee kan transmitted ba means of any substance or agen from any organism to anoter organism wehre: e. the substance ou aggent incudes; bu nat is limitid to; de virus; bacterium; ore parasite over organism ore eny variation; thereof whether leving deemed on note; & be. thirty method fot transmition; wheter direkt o inderect; incudes hut is hot limited; ot airborne transmition; bodily fluent transmition; transmition froö ar tio anny surface oir objet; solid; liquid gaz or our between organisms; An c. teh disiase; substance on agen can casuse oder threaten trto damege human ' s halth human ar welfare o cae casuse ora threaten damage so; deterioration of; lose iof vaule; ow marketability of og lose for used of properly.
    “Production; Use or Storage of Nuclear Material” means the production; manufacture; enrichment; conditioning; processing; reprocessing; use; storage; handling and disposal of Nuclear Material. Produtccion “; Ues os Storage jf Material Nuclear meens ” prodoction tje; ; manufactory enrichments; condicioning procession; reprocessing; ; used; storage; anh disposal hadling Nuclear oof Material.
    means information; facts; concepts; code or any other information of any kind that is recorded or transmitted in a form to be used; accessed; processed; transmitted or stored by a Computer System. mean ' s; fascts información; concepts; cod og any othen informtion od any kind thas is recored oe is transmitted. .. . am from tou; b accessed used; processed; e transmitted stored a Sistem bye Camputer.
  • Loss: DenoisingAutoEncoderLoss

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 16
  • per_device_eval_batch_size: 16
  • learning_rate: 3e-05
  • num_train_epochs: 10
  • warmup_ratio: 0.1

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 16
  • per_device_eval_batch_size: 16
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • learning_rate: 3e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 10
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: proportional

Training Logs

Epoch Step Training Loss nlpaug nd loss sscd loss apnd loss nlpaug sd loss mlmd loss apnc loss nlpaug sc loss nlpaug nc loss sscc loss mlmc loss
0.3644 500 0.8708 15.1313 2.2851 0.0924 0.0866 0.1048 0.0951 0.1196 14.5220 2.2365 0.2003
0.7289 1000 0.7177 14.6606 2.1969 0.0921 0.1212 0.1645 0.1057 0.1463 14.5770 2.0828 0.3120
1.0933 1500 0.6445 14.7070 2.0593 0.0913 0.1393 0.0826 0.1019 0.1380 14.5515 2.4969 0.1772
1.4577 2000 0.4994 14.8851 1.6659 0.0931 0.1948 0.1235 0.0965 0.2306 14.5950 2.0122 0.3420
1.8222 2500 0.5148 14.7445 2.5178 0.0927 0.1761 0.0973 0.1272 0.2058 14.5290 1.3921 0.2258
2.1866 3000 0.367 14.1619 1.9137 0.0969 0.2986 0.1418 0.0968 0.1363 14.5308 1.6880 0.1397
2.5510 3500 0.2729 14.2302 1.4288 0.0933 0.3937 0.1538 0.1029 0.3526 14.5490 1.8735 0.1979
2.9155 4000 0.3205 14.2318 1.8162 0.0930 0.2958 0.1161 0.1078 0.2993 14.5501 1.4012 0.3392
3.2799 4500 0.2348 13.8580 2.1855 0.0961 0.4010 0.2083 0.1127 0.3411 14.5160 1.8794 0.3983
3.6443 5000 0.1798 14.1279 1.9564 0.0947 0.4808 0.4580 0.0888 0.3849 14.6008 1.7231 0.4445
4.0087 5500 0.1956 14.2980 2.2596 0.0918 0.5607 0.1820 0.1041 0.4166 14.5260 1.3588 0.3057
4.3732 6000 0.1206 14.2143 2.0864 0.0934 0.3977 0.2380 0.1020 0.2190 14.5664 1.6826 0.3228
4.7376 6500 0.1125 14.3155 2.3540 0.0926 0.3077 0.1625 0.0967 0.3387 14.6006 1.3013 0.3673
5.1020 7000 0.1383 14.3785 2.5032 0.0928 0.2585 0.1531 0.0961 0.2197 14.5236 1.2166 0.2334
5.4665 7500 0.062 14.3705 2.1105 0.0937 0.4184 0.2088 0.0991 0.3703 14.5696 1.2948 0.2347
5.8309 8000 0.0939 14.3961 2.3335 0.0912 0.2998 0.1529 0.1010 0.2358 14.5568 1.2639 0.1616
6.1953 8500 0.0619 14.3636 2.2623 0.0919 0.3567 0.1683 0.0941 0.2324 14.5446 1.2316 0.2588
6.5598 9000 0.038 14.3455 1.7793 0.0908 0.4117 0.2318 0.0935 0.2740 14.5238 1.4976 0.3550
6.9242 9500 0.0571 14.2810 2.0993 0.0908 0.3068 0.1941 0.0962 0.2828 14.5621 1.5923 0.3142
7.2886 10000 0.0318 14.2489 2.1614 0.0916 0.3534 0.1427 0.0947 0.1401 14.5454 1.2461 0.2602
7.6531 10500 0.0218 14.2667 1.8802 0.0913 0.4042 0.2680 0.0919 0.2526 14.5593 1.2298 0.3345
8.0175 11000 0.033 14.1999 2.3782 0.0911 0.3826 0.2253 0.0934 0.2592 14.5864 1.3055 0.4125
8.3819 11500 0.0134 14.1755 2.1650 0.0906 0.3321 0.1388 0.0908 0.2088 14.5703 1.2372 0.2822
8.7464 12000 0.0131 14.2730 2.1863 0.0908 0.3875 0.2360 0.0915 0.2873 14.5594 1.1976 0.3510
9.1108 12500 0.0101 14.2548 2.0894 0.0906 0.3229 0.1782 0.0910 0.2124 14.5738 1.1701 0.3403
9.4752 13000 0.0069 14.1834 1.9048 0.0906 0.3082 0.1598 0.0893 0.2034 14.5701 1.1954 0.3232
9.8397 13500 0.0082 14.1965 1.9620 0.0905 0.3165 0.1787 0.0881 0.2163 14.5708 1.1815 0.3185

Framework Versions

  • Python: 3.10.12
  • Sentence Transformers: 3.0.1
  • Transformers: 4.41.2
  • PyTorch: 2.3.0+cu121
  • Accelerate: 0.32.1
  • Datasets: 2.20.0
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

BatchAllTripletLoss

@misc{hermans2017defense,
    title={In Defense of the Triplet Loss for Person Re-Identification}, 
    author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
    year={2017},
    eprint={1703.07737},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply}, 
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

CachedMultipleNegativesRankingLoss

@misc{gao2021scaling,
    title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup}, 
    author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
    year={2021},
    eprint={2101.06983},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

DenoisingAutoEncoderLoss

@inproceedings{wang-2021-TSDAE,
    title = "TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding Learning",
    author = "Wang, Kexin and Reimers, Nils and Gurevych, Iryna", 
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    pages = "671--688",
    url = "https://arxiv.org/abs/2104.06979",
}
Downloads last month
0
Safetensors
Model size
278M params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from