metadata
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:27191
- loss:TripletLoss
base_model: sentence-transformers/all-mpnet-base-v2
widget:
- source_sentence: >-
MySafetyEye offers an innovative and sustainable home surveillance and
alarm solution that repurposes old smartphones or tablets into monitoring
devices. Users can install the app on their primary phone and the old
device to set up a home alarm system in under two minutes. The service
includes free home surveillance and a subscription-based home alarm
feature with motion detection. When motion is detected, an alarm
notification is sent to the user's primary smartphone, providing a picture
of the event. The system also supports automatic activation and
deactivation based on the family's geolocation, enhancing convenience and
security.
sentences:
- >-
CREDO FISH was a Denmark-based company specializing in the import and
export of seafood products, including tiger-shark and angel-shark.
Established on December 1, 1995, the company was headquartered in
Frederikshavn, Nordjylland. It ceased operations on December 21, 2021.
- >-
Software-Pro ApS provides technology consulting services, specializing
in testing and quality assurance. The company assists clients in testing
their software and optimizing processes to enhance predictability in
software development. Software-Pro serves customers in Denmark.
- >-
Living Places is a comprehensive online resource dedicated to providing
detailed information about homes, neighborhoods, towns, and counties
across the United States. The platform offers insights into various
residential subdivisions, architectural styles, and historical areas,
catering to individuals seeking to learn more about different living
environments. With a focus on U.S. neighborhoods and towns, Living
Places serves as a valuable guide for those exploring housing options
and community characteristics.
- source_sentence: >-
Hindholm Privatskole is a private school located in Fuglebjerg, Denmark,
offering education from kindergarten through 9th grade. The school
emphasizes a respectful and inclusive environment, with a focus on
holistic development through varied learning experiences. Surrounded by
nature, the campus features a lake, green areas, sports fields,
playgrounds, and a forest area utilized for both educational and
recreational purposes. Hindholm Privatskole maintains small class sizes,
with a maximum of 24 students per class, and upholds traditions such as
daily morning singing sessions and annual excursions to enhance the
educational experience.
sentences:
- >-
Flindt-Kristensen is a Danish engineering firm specializing in product
development, concept design, structural analysis, and technical
drawings. They offer services such as transport tools, lifting tools,
and special projects, aiming to add value through innovation and
extensive technical knowledge. Their core values include service,
dedication, ambition, innovation, and fun.
- >-
Automators Holding ApS is a Danish company located at Knud Kristensens
Gade 11, 2300 København S. The company is active and has a Legal Entity
Identifier (LEI) code of 98450061ACA903B44085. As of the latest
available data, Automators Holding ApS has one employee. Further details
about the company's operations, products, or services are not readily
available from the provided sources.
- >-
Blomsterhaven is an integrated institution for children aged 0-6 years,
operating under Sydbyens Børnehus. It comprises three houses: Anemonen,
Bellis, and Crocus, accommodating up to 110 children. Located in a mixed
residential and industrial area near the local school, each house
features large playgrounds tailored to different age groups, promoting
motor and sensory development. The facility offers 30 nursery and 80
kindergarten places, staffed by 24 permanent employees and 3 regular
substitutes. Blomsterhaven emphasizes Danish traditions and utilizes the
surrounding nature for educational activities.
- source_sentence: >-
MENSEL & KASTEN RÅDGIVENDE EL-INGENIØRFIRMA is a Danish engineering
consultancy firm specializing in electrical engineering services for
construction and civil engineering projects. Established in 1996, the
company is based in Løgumkloster, Denmark.
([ownr.dk](https://ownr.dk/companies/public-profile/19115976?utm_source=openai))
The firm focuses on providing expert advice and solutions in the field of
electrical engineering, catering to various construction and
infrastructure projects.
sentences:
- >-
Dottir is a dynamic business law firm specializing in technology,
intellectual property, and data protection. Founded nearly a decade ago
by attorneys from top law firms, Dottir has evolved to meet the growing
demand for legal services in the expanding technology sector. Their
globally ranked team advises companies on a wide range of legal matters
arising from technology transactions, regulatory requirements,
litigation, and regulatory enforcement proceedings. They assist
fast-growing tech companies in protecting strategic IP assets and
translating business models into solid legal documents. Their services
also cover assisting large corporations in outsourcing or transforming
business-critical IT systems and services, dispute resolution concerning
complex technology-related disputes, as well as AI, data protection,
cybersecurity, and other data-related compliance issues.
- >-
Brenstensgård is a Danish partnership established in 2021, specializing
in the production of slaughter pigs. Located at Splitad 2, 8970 Havndal,
the company is owned equally by five partners: Valdemar Bay-Smidt, Tina
Bay-Smidt, Suzy Storm, Lars Bay-Smidt, and Hanne Bay-Smidt Lysgaard,
each holding a 20% stake.
([paqle.dk](https://www.paqle.dk/p/brenstensg%C3%A5rd-i-s/6402918?utm_source=openai))
- >-
GROVE-NIELSEN ApS is a Danish company that has published financial
statements for the years 2010 through 2013. The company reported profits
in 2012 and 2013, with net results of DKK 312,000 and DKK 279,000,
respectively. The company's total assets increased from DKK 2.821
million in 2012 to DKK 2.986 million in 2013, and equity rose from DKK
2.710 million to DKK 2.892 million over the same period. Further details
about the company's operations or industry are not available from the
provided information.
- source_sentence: >-
PM & JØ Holding is a Danish non-financial holding company established in
2017, located at Hejrevej 17, 8400 Ebeltoft. The company is co-owned by
Palle Martin Lund Jensen and Jette Ørnbøl Jeppesen, each holding a 50%
stake. As a holding entity, PM & JØ Holding primarily manages investments
in other companies, including L-tek A/S, where Palle Martin Lund Jensen
serves as director. The company reported a net profit of DKK 1.6 million
in 2023, reflecting its financial performance in managing its portfolio.
sentences:
- >-
FAUNA PASSAGE is a Danish company specializing in the research,
development, and production of products and concepts aimed at protecting
wildlife. Established on September 1, 1996, and headquartered at
Forskerparken 10, 5230 Odense M, the company focuses on creating
solutions to safeguard fauna from traffic-related challenges. Their
offerings include wildlife crossings such as tunnels, bridges, and other
structures designed to facilitate safe animal passage across human-made
barriers. The company is led by Director Lars Arthur Briggs.
- "\x0F8ksenm"
- >-
Fisker Olesen Holding ApS is a Danish holding company established in
2021, located at Gammel Kongevej 112, 3, 1850 Frederiksberg C. The
company is involved in managing investments and overseeing subsidiaries.
As of 2023, it reported a gross profit of -1 DKK and a pre-tax result of
-12 DKK. The company is led by Nina Fisker Olesen, who holds multiple
business roles in Denmark.
([proff.dk](https://www.proff.dk/firma/nina-fisker-olesen-holding-aps/frederiksberg-c/holdingselskaper/0P9Q29I06Y4?utm_source=openai),
[ownr.dk](https://ownr.dk/users/public-profile/4008071852?utm_source=openai))
- source_sentence: >-
Gammellund Ejendomme is a Danish real estate company based in Odense,
Denmark. The company specializes in property development, management, and
sales services. As of 2024, it employs one person and has reported total
assets of approximately 2.9 million DKK. The company is led by Director
Brian Gammellund Rasmussen and was founded on October 9, 2020.
sentences:
- >-
Jørgen Lund Frederiksen is a Danish company specializing in high-quality
carpentry and joinery services. Established in 1976, the company offers
a wide range of services, including small repairs and large construction
projects, serving private clients, businesses, and public institutions.
With a team of 35-40 skilled employees, Jørgen Lund Frederiksen
emphasizes loyalty, flexibility, and responsibility, ensuring
professional handling of projects from start to finish. The company is
also ISPM-15 certified for manufacturing heat-treated wooden packaging,
such as pallets and transport boxes, adhering to international
standards.
- >-
KERT INVEST ApS, established on February 6, 2014, is a Danish private
limited company based in Helsingør. The company specializes in
purchasing, renovating, and selling real estate, as well as trading
securities and related activities.
([find-virksomhed.dk](https://find-virksomhed.dk/firma/kert-invest-aps-35658173?utm_source=openai))
The company's registered address is Grønnehavevej 7, 1, 3000 Helsingør.
([lei.bloomberg.com](https://lei.bloomberg.com/gleifs/view/549300IRNKZTEYCVK378?utm_source=openai))
- >-
GANNI is a Danish contemporary fashion brand founded in 2000 by Frans
Truelsen and revitalized in 2009 by husband-and-wife duo Ditte and
Nicolaj Reffstrup.
([en.wikipedia.org](https://en.wikipedia.org/wiki/Ganni?utm_source=openai))
The brand offers a wide range of women's apparel, footwear, eyewear,
bags, jewelry, and accessories, embodying a playful and effortless
aesthetic that redefines Scandinavian style.
([fashionunited.com](https://fashionunited.com/companies/ganni?utm_source=openai))
GANNI is committed to responsible practices, striving to make
environmentally friendly choices and improve daily.
([kristak.com](https://kristak.com/pages/ganni?utm_source=openai))
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
- cosine_accuracy
model-index:
- name: SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
results:
- task:
type: triplet
name: Triplet
dataset:
name: Unknown
type: unknown
metrics:
- type: cosine_accuracy
value: 0.9343575239181519
name: Cosine Accuracy
SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
This is a sentence-transformers model finetuned from sentence-transformers/all-mpnet-base-v2 on the csv dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: sentence-transformers/all-mpnet-base-v2
- Maximum Sequence Length: 384 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
- Training Dataset:
- csv
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 384, 'do_lower_case': False}) with Transformer model: MPNetModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'Gammellund Ejendomme is a Danish real estate company based in Odense, Denmark. The company specializes in property development, management, and sales services. As of 2024, it employs one person and has reported total assets of approximately 2.9 million DKK. The company is led by Director Brian Gammellund Rasmussen and was founded on October 9, 2020.',
"KERT INVEST ApS, established on February 6, 2014, is a Danish private limited company based in Helsingør. The company specializes in purchasing, renovating, and selling real estate, as well as trading securities and related activities. ([find-virksomhed.dk](https://find-virksomhed.dk/firma/kert-invest-aps-35658173?utm_source=openai)) The company's registered address is Grønnehavevej 7, 1, 3000 Helsingør. ([lei.bloomberg.com](https://lei.bloomberg.com/gleifs/view/549300IRNKZTEYCVK378?utm_source=openai))",
"GANNI is a Danish contemporary fashion brand founded in 2000 by Frans Truelsen and revitalized in 2009 by husband-and-wife duo Ditte and Nicolaj Reffstrup. ([en.wikipedia.org](https://en.wikipedia.org/wiki/Ganni?utm_source=openai)) The brand offers a wide range of women's apparel, footwear, eyewear, bags, jewelry, and accessories, embodying a playful and effortless aesthetic that redefines Scandinavian style. ([fashionunited.com](https://fashionunited.com/companies/ganni?utm_source=openai)) GANNI is committed to responsible practices, striving to make environmentally friendly choices and improve daily. ([kristak.com](https://kristak.com/pages/ganni?utm_source=openai))",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Evaluation
Metrics
Triplet
- Evaluated with
TripletEvaluator
| Metric | Value |
|---|---|
| cosine_accuracy | 0.9344 |
Training Details
Training Dataset
csv
- Dataset: csv
- Size: 27,191 training samples
- Columns:
anchor,positive, andnegative - Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 105.8 tokens
- max: 335 tokens
- min: 3 tokens
- mean: 105.7 tokens
- max: 361 tokens
- min: 3 tokens
- mean: 108.92 tokens
- max: 383 tokens
- Samples:
anchor positive negative ES Holding Aalborg II was a non-financial holding company based in Støvring, Denmark. Established on November 20, 2019, the company was dissolved after a demerger on December 20, 2019. Its primary purpose was to hold shares in subsidiaries and associated companies, engage in investment and financing activities, and conduct other related business as deemed appropriate by management. The company was registered with a capital of 200,000 DKK and was managed by director Ejner Sørensen. The registered address was Guldbækvej 116, 9530 Støvring, Denmark.Malver Holding is a Danish non-financial holding company established on October 9, 2018. Based in Copenhagen, it focuses on investment and holding activities. The company is solely owned and directed by Nicklas Malver, who holds 100% ownership and voting rights. As of 2023, Malver Holding reported a net profit of DKK 437,389 and total assets amounting to DKK 3,889,000. The company's registered address is Hjørringgade 1, 3. tv., 2100 København Ø.Research Infrastructure Consultancy Services is a Danish firm specializing in providing expert guidance and support for the development and management of research infrastructures. Their services encompass strategic planning, project management, and operational optimization to enhance the efficiency and effectiveness of research facilities. By collaborating closely with clients, they aim to tailor solutions that meet the unique needs of various research institutions.SIGNCONCEPT is a Danish company specializing in the signage and advertising industry. Established in 2006, the company operates from its headquarters at Industrivej 60, 6740 Bramming, Denmark. SIGNCONCEPT offers a range of products and services related to signs and advertising materials, catering to various business needs. The company is registered under CVR number 30502590 and has a workforce of approximately 4 employees. For more information, visit their official website at http://www.signconcept.dk.Fleet Complete Danmark specializes in fleet management solutions, offering GPS tracking, electronic logbooks, and task management systems to optimize vehicle fleets and mobile workforces. Their services aim to enhance performance, reduce fuel consumption, and integrate seamlessly with existing operational systems.Lidemark Kirke is a historic church located in Bjæverskov, Denmark. Built in the 12th century in Romanesque style, the original structure comprises an apse, chancel, and nave. Around 1500, additions such as a porch, sacristy, and tower were incorporated. The church is primarily constructed from chalk and split fieldstone. Notable features include an altarpiece with two large columns and a painting titled "Christ in the Resurrection" by F. Storck from 1860. The tower houses a beautifully crafted organ built by K. Olsen in 1870, and the church has two bells dating from 1749 and 1842. A Renaissance gravestone commemorates Hartvig Høcken, a local nobleman who passed away in 1595. The church is part of a collaborative network with Bjæverskov, Gørslev, and Vollerslev churches, sharing clergy and a parish hall.Indian Guro ApS was a Danish company established in 2017. The company was dissolved after bankruptcy in 2023. (paqle.dk)BEG BESLAGSMEDIE ApS was a Danish company established in 2014, specializing in services related to livestock breeding. The company was dissolved after bankruptcy in August 2023. (paqle.dk)Gilleleje Lægecenter is a medical clinic located in Gilleleje, Denmark, offering same-day consultations for various health concerns. Patients can schedule appointments electronically via the clinic's website or the 'Minlæge' app, or by phone. The clinic provides both in-person and video consultations, emphasizing prompt and accessible healthcare services. (xn--gillelejelgecenter-xub.dk) - Loss:
TripletLosswith these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.4 }
Evaluation Dataset
csv
- Dataset: csv
- Size: 1,432 evaluation samples
- Columns:
anchor,positive, andnegative - Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 106.0 tokens
- max: 325 tokens
- min: 3 tokens
- mean: 106.5 tokens
- max: 384 tokens
- min: 3 tokens
- mean: 107.84 tokens
- max: 384 tokens
- Samples:
anchor positive negative Casela ApS is a Danish holding company established on June 17, 2004, located at Haugesundvej 1, 2850 Nærum. The company primarily functions as a non-financial holding entity, owning capital interests in other companies. As of 2023, Casela ApS reported a net profit of 389,125 DKK and an equity of 10,192,000 DKK. The company is led by Director Klaus Kastrup-Larsen, who has been in position since November 7, 2023. The sole owner is Lasse Bo Steenholt, holding 100% of the shares and voting rights since June 17, 2004.EILKAER HOLDING is a Danish private limited company (Anpartsselskab) established on May 27, 2014. The company is located at Rejnstrupvej 15, 4250 Fuglebjerg, Denmark. Its primary purpose is to own shares and equity interests in other capital companies, manage assets, and engage in related activities as deemed appropriate by the management. The company is led by Director Thomas Bojesen Eilkær, who holds 100% ownership and voting rights. As of 2023, EILKAER HOLDING reported a gross profit of DKK -16,237 and a net income of DKK -4,768. The company is active and operates in the non-financial holding companies industry.Vesterled Frugtplantage, located on Fejø Island in Denmark, specializes in cultivating high-quality apples, pears, and plums. Benefiting from Fejø's favorable climate, the plantation produces fruit known for its exceptional taste and quality. To ensure freshness, Vesterled Frugtplantage operates its own storage and packing facilities, delivering freshly picked fruit from early August. The plantation adheres to both organic farming practices and the principles of Dansk I.P., minimizing chemical use for the benefit of consumers and the environment.S/I Margrethe Hjemmet is a private nursing home located in the heart of Roskilde, Denmark. The facility focuses on promoting active aging for both body and soul, providing a harmonious environment for its residents. With 44 apartments spread over two floors, each unit includes a private bathroom and wardrobe, and most feature a terrace or balcony. The home offers various amenities such as a cultural center, dining room, garden, workshop, hair salon, wellness room, and exercise equipment. Emphasizing the importance of family involvement, Margrethe Hjemmet views relatives as valuable resources and staff as catalysts for a meaningful, social, and active elderly life.Medarbejderfond for ansatte i ISS Facility Services is a foundation established on December 31, 2005, located at Gyngemose Parkvej 50, 2860 Søborg, Denmark. The foundation operates within the industry of general building cleaning services. As of now, there is no official website registered for this organization.Manbook.dk is a Danish company specializing in providing flexible staffing solutions across Denmark. They offer temporary workers for various tasks, including accounting, legal assignments, transportation, and security services. Their services are available 24/7, with the ability to dispatch personnel within two hours. Manbook.dk emphasizes creating a secure environment for both clients and employees, handling administrative tasks such as payroll, pensions, and holidays. Their office is located at Vallensbækvej 6, 2605 Brøndby, Denmark.Børnehuset Goethesgade is a self-governing, age-integrated daycare institution located in Sønderborg, Denmark. (boernehuset-goethesgade.aula.dk) Established in 1993, it offers a nurturing environment for children aged 0-6 years, comprising a nursery ('bobler') with 18 places and a kindergarten ('stjerner') with 38 places. (boernehuset-goethesgade.aula.dk) The institution emphasizes small group activities to cater to individual child development and foster strong peer relationships. (boernehuset-goethesgade.aula.dk) Situated centrally, it leverages its proximity to nature and the local community to enhance children's daily experiences. (boernehuset-goethesgade.aula.dk)Horsens Gymnasium & HF is an educational institution located in Horsens, Denmark, offering both the general upper secondary education (STX) and the higher preparatory examination (HF). The school provides a range of study programs, including music, biology and chemistry, social sciences, mathematics, physics, chemistry, geoscience, language studies, and biotechnology. It emphasizes a broad educational foundation, preparing students for further education. The institution also boasts an impressive art collection featuring works by artists such as Kasper Bonnén, Michael Kvium, Cathrine Raben Davidsen, and Poul Anker Bech. (horsens-gym.dk)Of Holding is a Danish company based in Aalborg SØ, Nordjylland, specializing in the management of companies and enterprises, particularly as a holding company. The key principal is Ole Frøkjær. Further details about the company's operations and services are not publicly available. - Loss:
TripletLosswith these parameters:{ "distance_metric": "TripletDistanceMetric.COSINE", "triplet_margin": 0.4 }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: stepsper_device_train_batch_size: 6per_device_eval_batch_size: 6gradient_accumulation_steps: 3num_train_epochs: 2warmup_ratio: 0.1fp16: Truedataloader_pin_memory: False
All Hyperparameters
Click to expand
overwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 6per_device_eval_batch_size: 6per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 3eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 2max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Truefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Falsedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters:auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportional
Training Logs
| Epoch | Step | Training Loss | Validation Loss | cosine_accuracy |
|---|---|---|---|---|
| -1 | -1 | - | - | 0.8275 |
| 0.0662 | 100 | 0.4326 | - | - |
| 0.1324 | 200 | 0.2973 | - | - |
| 0.1655 | 250 | - | 0.0902 | 0.9141 |
| 0.1986 | 300 | 0.2914 | - | - |
| 0.2648 | 400 | 0.305 | - | - |
| 0.3310 | 500 | 0.2878 | 0.0920 | 0.9092 |
| 0.3972 | 600 | 0.308 | - | - |
| 0.4634 | 700 | 0.2722 | - | - |
| 0.4965 | 750 | - | 0.0805 | 0.9218 |
| 0.5296 | 800 | 0.2591 | - | - |
| 0.5958 | 900 | 0.2564 | - | - |
| 0.6620 | 1000 | 0.245 | 0.0815 | 0.9197 |
| 0.7282 | 1100 | 0.2395 | - | - |
| 0.7944 | 1200 | 0.2559 | - | - |
| 0.8274 | 1250 | - | 0.0818 | 0.9232 |
| 0.8605 | 1300 | 0.2581 | - | - |
| 0.9267 | 1400 | 0.2692 | - | - |
| 0.9929 | 1500 | 0.2544 | 0.0738 | 0.9302 |
| 1.0589 | 1600 | 0.2001 | - | - |
| 1.1251 | 1700 | 0.2112 | - | - |
| 1.1582 | 1750 | - | 0.0729 | 0.9302 |
| 1.1913 | 1800 | 0.1926 | - | - |
| 1.2575 | 1900 | 0.1801 | - | - |
| 1.3237 | 2000 | 0.1684 | 0.0706 | 0.9267 |
| 1.3899 | 2100 | 0.1831 | - | - |
| 1.4561 | 2200 | 0.1963 | - | - |
| 1.4892 | 2250 | - | 0.0719 | 0.9281 |
| 1.5223 | 2300 | 0.1878 | - | - |
| 1.5885 | 2400 | 0.2028 | - | - |
| 1.6547 | 2500 | 0.2045 | 0.0685 | 0.9323 |
| 1.7209 | 2600 | 0.1853 | - | - |
| 1.7871 | 2700 | 0.1793 | - | - |
| 1.8202 | 2750 | - | 0.0665 | 0.9344 |
| 1.8533 | 2800 | 0.1772 | - | - |
| 1.9195 | 2900 | 0.1722 | - | - |
| 1.9857 | 3000 | 0.1797 | 0.0658 | 0.9344 |
Framework Versions
- Python: 3.13.2
- Sentence Transformers: 4.1.0
- Transformers: 4.52.1
- PyTorch: 2.7.0+cu126
- Accelerate: 1.7.0
- Datasets: 3.6.0
- Tokenizers: 0.21.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
TripletLoss
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}