SentenceTransformer based on tyuan73/ModernBERT-large-with-new-tokenizer

This is a sentence-transformers model finetuned from tyuan73/ModernBERT-large-with-new-tokenizer on the processed_yahoo_finance_stockmarket_news dataset. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: ModernBertModel 
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("tyuan73/finetuned-modernbert-finance-large")
# Run inference
sentences = [
    "Stocks will rally 5% in just 5 days after the Fed's rate decision this week, Fundstrat says",
    'Photo by Cindy Ord/Getty Images for YahooThe stock market could see a 5% gain over the next week, according to Fundstrat\'s Tom Lee.The rally would be sparked by a dovish Fed FOMC meeting on Wednesday that all but confirms imminent interest rate cuts."These are significant gains, implying the S&P 500 could gain 200-300 points in the next week," Lee said.The stock market is poised to surge as much as 5% in the next week, according to a Wednesday note from Fundstrat.The research firm said it expects an explosive rally in theS&P 500to materialize in the five days following the Federal Reserve\'s FOMC meeting on Wednesday.While the Fed isnot expected to cut interest rates at its July FOMC meeting,it is expected to signal that a rate cut is all but certain when it meets again in September."The key premise is the Fed is likely to commit to a September rate cut of at least 25bp. A possibility of more than that is not necessary. And while bond markets have priced in 100% probability of this, equity investors likely will not be convinced until the Fed affirms this as such," Fundstrat\'s Tom Lee said.Thenear certainty of a rate cut from the Fedin September should spark a risk-on rally for stocks, especially given that theNasdaq 100has already experienced a near-10% correctionin recent weeks, according to the note."Overall, we believe risk-on moment is coming," Lee said.Lee\'s confidence in a strong rally post-Fed meeting is based on the fact that recent Fed meetings have sparked a big rally in stocks.In the past two years, when stocks were down heading into a Fed FOMC meeting, stocks saw a five-day gain of as much as 5.5% and a median gain of 3.4%."These are significant gains, implying the S&P 500 could gain 200-300 points in the next week. This is very compelling in our view," Lee said.And while a 25 basis point interest rate cut may not seem like much, it has real-world economic impacts that could ultimately influence the US housing market in a big way."Here are some tangible reasons a Fed cut makes sense: 30-year mortgage has excess spread to 10-year due to uncertainty. The spread could shrink from 270 basis points to 170 basis points (50-year average)," Lee explained.Interest rate cuts from the Fed, however small, would also help alleviate an ongoing slowdown in the housing, durables, and auto markets, Lee said.A 5% rally in the S&P 500 would catapult the index to fresh record highs, completely erasing its 5% decline over the past few weeks.Read the original article onBusiness Insider',
    'AMBALA, India (Reuters) - Indian farmers demanding higher prices for their crops will postpone a planned protest march to New Delhi until unions hold another round of talks with government ministers on Sunday.\n\nAgriculture Minister Arjun Munda, who met farmers\' representatives on Thurdsay along with Commerce Minister Piyush Goyal and Minister of State for Home Affairs Nityanand Rai, said the talks were "positive".\n\n"We have decided that the next meeting to take the discussion forward will take place on Sunday at 6 pm...We believe we will all find a solution together peacefully," he told reporters following Thursday\'s meeting.\n\nProtest leader Jagjit Singh Dallewal also told reporters the farmers would hold off their march for now.\n\n"When the meetings have started, if we move forward (towards Delhi) then how will meetings happen?" Dallewal told reporters, adding that the protest "will continue peacefully".\n\nThousands of farmers had embarked on the "Delhi Chalo", or "Let\'s go to Delhi" march earlier this week to press the government to set a minimum price for their produce, but they were stopped by security forces about 200 kms (125 miles) away from the capital, triggering clashes.\n\nThe protests erupted a few months before India is due to hold national elections in which Prime Minister Narendra Modi is seeking a third term. Farmers form an influential voting bloc.\n\nThe farmers remained camped on the border between Punjab and Haryana states on Friday. Security forces have used concrete and metal barricades, as well as drones carrying tear gascanisters, to stop them for advancing.\n\nThe protest comes two years after Modi\'s government, following a similar protest movement, repealed some farm laws and promised to find ways to ensure support prices for all produce.\n\n(Writing by Sakshi Dayal; editing by Miral Fahmy)',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Triplet

Metric Value
cosine_accuracy 1.0

Training Details

Training Dataset

processed_yahoo_finance_stockmarket_news

  • Dataset: processed_yahoo_finance_stockmarket_news at 6949dc8
  • Size: 14,845 training samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 7 tokens
    • mean: 16.02 tokens
    • max: 42 tokens
    • min: 10 tokens
    • mean: 788.29 tokens
    • max: 8192 tokens
    • min: 104 tokens
    • mean: 270.14 tokens
    • max: 669 tokens
  • Samples:
    anchor positive negative
    If You'd Invested $1,000 in Amazon Stock 10 Years Ago, Here's How Much You'd Have Today It might be hard to imagine, butAmazon(NASDAQ: AMZN)began only 30 years ago as an online bookseller. Now the site sells a wide range of goods. The business has grown quickly and includes the popular Amazon Prime subscription service, electronic devices, and Amazon Web Services (AWS).The shares have also generated a lot of wealth for investors. Investors who bought shares a decade ago, and held them, would have seen an increase of almost 1,000%.Trouncing the marketAmazon had itsinitial public offering (IPO)in 1997, when its sales were about $148 million for the year. The figure grew to nearly $575 billion last year.But you didn't have to buy the shares at the IPO price to make a lot of money. Over the last decade, Amazon's shares have appreciated 945%. That easily bested theS&P 500's 227% total return.Even starting with a relatively small $1,000 just 10 years ago, you would now have about $10,500. Placing the same amount in the S&P 500 would've resulted in about $3,300.Amazon's stock wi... My husband of almost a year went home with another girl while we were freshly split up. Nothing happened, but he still went home with her from a bar and slept over in her bed. I am pissed about that, and so we split then. He since came back to me apologizing..begging for another chance...and I gave him one. Things went well for a couple of weeks, but I got drunk one night and brought it up, so we fought about it. Then at random times I think about it and bring it up to him. He gets so angry every time it comes up and has asked if he has to deal with hearing about this every single day. He asked me to decide if I could stay with him and deal with this or not. He took all of his stuff out of my place (we are separated, but he stays with me since we are working things out) and is staying in his place because we argued. We talked and I thought it was a good talk. I usually blame everything on him,but today I took some blame and truly want to work it out.He just got frustrated,what to do
    Chemicals Company Dow Cuts Outlook Amid Production Challenges Chemicals Company Dow Cuts Outlook Amid Production ChallengesDow Inc.(NYSE:DOW) shares are trading lower after it revised its third-quarter outlook.The company lowered its guidance for revenue to around $10.6 billion (vs. consensus $11.0 billion) from $11.1 billionearlierandexpects operating EBITDA of around $1.3 billion.The revised outlook reflects a major unplanned event at a Texas ethylene cracker in late July, alongside higher input costs and margin compression in Europe.However, these challenges are expected to be partly mitigated by improved pricing and feedstock costs in North America for Packaging & Specialty Plastics.Also Read:Linde Invests Over $2B For Clean Hydrogen, Inks Supply Deal With Dow: DetailsJim Fitterling, chair and chief executive officer, said, “As we look to the fourth quarter, we expect typical seasonality in demand. However, we expect a positive impact from lower turnaround costs, higher operating rates as we ramp up our Texas cracker, and fewer weather-relate... I have a fourth degree polynomial, namely:\n\ny = 0.0003x4 - 0.0272x3 + 0.6806x2 - 0.8345x + 0.5659\n\nWhen plotting this line, from points x=560 to x=790, the way that that curve looks...I want to know at what point am i going to get the greatest increase in y, with the least increase in x...do you get waht i'm saying? At what point, like at say 700 to 701, is that where i am going to see the largest y increase with the least x increase. I want it to be proportaional,like for example:\n\nMaybe from 700-701 is just an increase of 2 in the y direction, but if going from 703-705 i get an incrase of 5, thats more benificial for me, cause i more than doubled my input...get waht i'm saying??\n\nPlease help here, i'm not sure if i should be taking the derivitive or what.\n\nThanks
    Beyond Meat Stock Is Down 20% This Year, Time to Buy? Beyond Meat Inc.(NASDAQ: BYND)continues to struggle with its mission of delivering "the future of protein" amid weak sales and large recurring losses. That trend continued as the plant-based meat leader posted its latest earnings report on May 8.The market remains skeptical that the company can manage a growth rebound. The stock is now down more than 40% over the past year.Is now the time to buy Beyond Meat, or are investors better off leaving this one off their plate?A recap of Beyond Meat's Q1 earnings reportFor the first quarter of 2024, Beyond Meat's revenue fell by 18% year over year to $75.6 million, marking the eighth consecutive quarter in decline.The top-line weakness was driven by a 16.1% decrease in the volume of products sold despite an effort to stir up demand through promotional discounting. The company also cited the impact of discontinuing its "Beyond Meat Jerky" along with lower "Beyond Chicken" sales, against large initial orders from Europe in early 2023.Demand was s... ASTOR FAMILY, a famous American
    family representing one of the three or four
    greatest private properties in the world. A
    family in the Old World sense, — a territorial
    aristocracy, impossible to destroy, and fortified
    with legal immunities and privileges, — can
    hardly be founded in America; but the Astors
    have approached it as nearly as our institutions
    will admit. They form a group of immense
    hereditary real-estate owners, with holdings so
    solidly based and well distributed in the metropolis
    of America that no apparent catastrophe
    save a failure of heirs could extinguish it; and
    though originally springing from mercantile
    business, removed by some three-quarters of a
    century from its actual conduct. For many years
    they were known as “the landlords of New
    York,” and the best of landlords, prompt, just and
    courteous; they still probably form the largest
    set of individual real-estate holders. The
    family is also connected with notable municipal
    charities and public foundations. See
    (1763...
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Evaluation Dataset

processed_yahoo_finance_stockmarket_news

  • Dataset: processed_yahoo_finance_stockmarket_news at 6949dc8
  • Size: 14,845 evaluation samples
  • Columns: anchor, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    anchor positive negative
    type string string string
    details
    • min: 7 tokens
    • mean: 16.07 tokens
    • max: 32 tokens
    • min: 11 tokens
    • mean: 789.07 tokens
    • max: 7142 tokens
    • min: 101 tokens
    • mean: 269.38 tokens
    • max: 711 tokens
  • Samples:
    anchor positive negative
    What the Tech Bubble in 2000 May Tell Us About the Stock Market Today Spencer Platt / Getty ImagesKey TakeawaysRecent losses for tech stocks have coincided with gains for utilities, healthcare, and consumer staples, an echo of the 2000 dotcom bubble, according to a recent Deutsche Bank note.Uncertainty about the U.S. election and monetary policy could continue to weigh on tech stocks in the coming months, according to one analyst.Even with the bursting of the dotcom bubble, investments in tech have significantly outperformed the broader market over the long-term.There’s reason to believe history is repeating itself, according to a recent Deutsche Bank note that draws parallels between the current rotation out of tech mega-caps and the bursting of the tech bubble in 2000.In the 9 months after the bubble reached its peak in March 2000, tech stocks fell more than 50%. The Consumer Staples, Utilities, and Healthcare sectors, meanwhile, each rose by more than 35%.Deutsche BankResearch analyst Jim Reid sees similarities between that moment and thecurrent secto... My friend has a 1 year old girl and her husband cheated on her with the receptionist at the doctors office they take the little girl. They are now divorced. He has 1 over night visit per week sometimes her show sometimes he doesn't. And when he does he often times drops her off at his mothers or drags the kid from girl friend to girl friends house. How do you convince him that his kid should come first. Her mom is a wonderful mother and person that puts her first. The way he treats her kid has her in turmoil. Please help.
    Europe’s stock leaders are fading in bad sign for future returns (Bloomberg) — The engines behind two years of European stock gains are losing power, leaving the region’s equities facing a void at a time when concerns over slowing growth and China tensions are testing investor confidence.Most Read from BloombergHousing’s Worst Crisis in Decades Reverberates Through 2024 RaceAn Affordable Nomadic Home Design Struggles to Adapt to Urban LifeUS Driving and Congestion Rates Are Higher Than EverA City Finds Success Using 'Trees as Medicine'The Hague Is World’s First City to Ban Oil and Air Travel AdsA luxury sector led by LVMH Moët Hennessy Louis Vuitton SE (MC.PA,LVMHF) has tumbled over the past six months along with automotive firms, while in more recent months healthcare heavyweights such as Novo Nordisk A/S (NVO,NOVO-B.CO) and tech leaders including ASML Holding NV (ASML) have slid from their peaks. And with no obvious candidates to take the baton, the region’s equity performance has been left looking exposed.Already this year, investors have withdra... I'm quite conservative for someone my age. I don't like heavy drinking (I do drink. I just don't get wasted) and I don't like wild parties. I'm very religious and I'm looking for someone who shares my convictions who I will eventually hopefully marry. The problem that I seem to have is most people my age or either non-religious partiers or their fundamentalist Bible beaters who don't know how to do much other than quote scriptures. I've tried online dating and I've tried asking out people at church. Why can't I find a normal person that shares my beliefs? The ones I do find tend to always dating someone else before I ever get a chance.
    2 Billionaire Investors Are Selling Nvidia and Buying This Artificial Intelligence (AI) Stock Instead Great investing minds appear to be thinking alike about two top AI stocks. "A Song" (Once—and only once—you gave), a poem by Edwin Arnold
    "A Song" (Gentle nymphs, be not refusing), a poem by William Browne
    "A Song" (The sparkling eye, the mantling cheek), a poem by William Cowper
    "A Song" (On a summer's day as I sat by a stream), a poem by Paul Laurence Dunbar
    "A Song" (Thou art the soul of a summer's day), a poem by Paul Laurence Dunbar
    "A Song" (My heart has flown on wings to you, away), a poem by Francis Ledwidge
    "A Song" (Yes! "lower to the level"), a poem by Frances Sargent Osgood
    "A Song" (Call me pet names, dearest! Call me a bird), a poem by Frances Sargent Osgood
    "A Song" (O, red is the English rose), a poem by Charles Alexander Richmond
    "A Song" (I thought no more was needed), a poem by William Butler Yeats

    See also
    Song
    The Song
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim"
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 2
  • per_device_eval_batch_size: 2
  • num_train_epochs: 1
  • warmup_ratio: 0.1
  • fp16: True
  • gradient_checkpointing: True
  • batch_sampler: no_duplicates

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 2
  • per_device_eval_batch_size: 2
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 1
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: True
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • gradient_checkpointing: True
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • eval_use_gather_object: False
  • average_tokens_across_devices: False
  • prompts: None
  • batch_sampler: no_duplicates
  • multi_dataset_batch_sampler: proportional

Training Logs

Epoch Step Training Loss Validation Loss test_cosine_accuracy
0.1042 100 1.8001 2.5929 -
0.2083 200 1.3643 0.9732 -
0.3125 300 0.8984 1.1093 -
0.4167 400 1.0902 0.9527 -
0.5208 500 0.913 0.5731 -
0.625 600 0.8656 1.4353 -
0.7292 700 0.807 0.7052 -
0.8333 800 0.7846 0.7704 -
0.9375 900 0.5951 0.9136 -
-1 -1 - - 1.0

Framework Versions

  • Python: 3.11.11
  • Sentence Transformers: 3.4.1
  • Transformers: 4.49.0
  • PyTorch: 2.5.1+cu124
  • Accelerate: 1.3.0
  • Datasets: 3.3.2
  • Tokenizers: 0.21.0

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
Downloads last month
4
Safetensors
Model size
396M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for tyuan73/finetuned-modernbert-finance-large

Finetuned
(1)
this model

Evaluation results