Edit model card

bertopic-umap15-hbd15-topn15

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("ahessamb/bertopic-umap15-hbd15-topn15")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 105
  • Number of training documents: 14320
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 market - price - nft - said - cryptocurrency 15 -1_market_price_nft_said
0 korea - funds - attack - hackers - fraud 6725 0_korea_funds_attack_hackers
1 usd - 500 - near - bitcoin - consolidating 706 1_usd_500_near_bitcoin
2 sized - digest - news - blockchain - radar 417 2_sized_digest_news_blockchain
3 merge - ethereum - proof - fork - beacon 236 3_merge_ethereum_proof_fork
4 rate - cpi - hikes - fomc - bitcoin 209 4_rate_cpi_hikes_fomc
5 luna - ustc - entropy - proposal - terraform 207 5_luna_ustc_entropy_proposal
6 brands - meta - worlds - immersive - decentraland 206 6_brands_meta_worlds_immersive
7 russia - sanctions - crypto - ruble - settlements 187 7_russia_sanctions_crypto_ruble
8 gensler - securities - coinbase - industry - regulation 178 8_gensler_securities_coinbase_industry
9 blockchain - web3 - gamers - p2e - industry 174 9_blockchain_web3_gamers_p2e
10 miners - carbon - power - bitcoin - report 157 10_miners_carbon_power_bitcoin
11 funding - round - ventures - capital - gamestop 151 11_funding_round_ventures_capital
12 xrp - ripple - price - level - resistance 146 12_xrp_ripple_price_level
13 etf - blackrock - grayscale - bitcoin - futures 145 13_etf_blackrock_grayscale_bitcoin
14 web3 - disco - mcmullen - identity - platforms 144 14_web3_disco_mcmullen_identity
15 protocols - decentralized - newsletter - cefi - lending 141 15_protocols_decentralized_newsletter_cefi
16 inu - lucie - meme - tokens - ecosystem 139 16_inu_lucie_meme_tokens
17 ftx - sam - bankman - bankruptcy - ceo 132 17_ftx_sam_bankman_bankruptcy
18 tether - usdt - documents - coindesk - stablecoins 123 18_tether_usdt_documents_coindesk
19 el - bukele - nayib - bitcoin - x93 120 19_el_bukele_nayib_bitcoin
20 dogecoin - musk - meme - twitter - level 114 20_dogecoin_musk_meme_twitter
21 26 - resistance - near - btc - bulls 106 21_26_resistance_near_btc
22 nft - opensea - doppel - marketplaces - rug 101 22_nft_opensea_doppel_marketplaces
23 cfds - traders - assets - cryptocurrency - adoption 95 23_cfds_traders_assets_cryptocurrency
24 difficulty - hashrate - bitcoin - network - height 90 24_difficulty_hashrate_bitcoin_network
25 ubi - cointelegraph - simonin - bitcoin - income 88 25_ubi_cointelegraph_simonin_bitcoin
26 coinbase - bitkey - india - ceo - fees 85 26_coinbase_bitkey_india_ceo
27 donated - russia - invasion - transformation - donors 83 27_donated_russia_invasion_transformation
28 celsius - cel - withdrawals - company - mashinsky 81 28_celsius_cel_withdrawals_company
29 nfts - collections - million - floor - cryptopunk 81 29_nfts_collections_million_floor
30 blockchain - bvm - mvc - maestro - databases 78 30_blockchain_bvm_mvc_maestro
31 crypto - merchants - mastercard - feature - cashapp 78 31_crypto_merchants_mastercard_feature
32 ada - cardano - bearish - satoshis - market 76 32_ada_cardano_bearish_satoshis
33 nft - sartoshi - artists - snoop - community 75 33_nft_sartoshi_artists_snoop
34 solana - bearish - outages - fibonacci - resistance 72 34_solana_bearish_outages_fibonacci
35 hinman - ripple - speech - emails - xrp 71 35_hinman_ripple_speech_emails
36 oecd - taxation - framework - india - electronic 70 36_oecd_taxation_framework_india
37 terraform - montenegro - korea - x93 - milojko 69 37_terraform_montenegro_korea_x93
38 order - securities - freeze - restraining - cyprus 68 38_order_securities_freeze_restraining
39 manchester - sponsorship - bcci - com - fans 68 39_manchester_sponsorship_bcci_com
40 surveyed - millennials - managers - crypto - report 67 40_surveyed_millennials_managers_crypto
41 whales - eth - market - transactions - usdt 66 41_whales_eth_market_transactions
42 binance - kazakhstan - changpeng - expansion - 500m 61 42_binance_kazakhstan_changpeng_expansion
43 twitter - musk - metatime - jack - yaccarino 59 43_twitter_musk_metatime_jack
44 rsi - price - line - altcoin - bullish 59 44_rsi_price_line_altcoin
45 china - huobi - hkma - regulatory - companies 57 45_china_huobi_hkma_regulatory
46 token - leo - surged - tlos - graph 57 46_token_leo_surged_tlos
47 cbdcs - governor - banks - mit - project 56 47_cbdcs_governor_banks_mit
48 daos - chorus - lieberman - decentralized - organizations 51 48_daos_chorus_lieberman_decentralized
49 fungible - nonfungible - tokens - nft - 2021 51 49_fungible_nonfungible_tokens_nft
50 altcoins - levels - overhead - support - bounce 50 50_altcoins_levels_overhead_support
51 yuan - digital - tax - cbdc - wallets 43 51_yuan_digital_tax_cbdc
52 depot - company - invest - banking - america 42 52_depot_company_invest_banking
53 markets - advice - bull - hodlers - nasdaily 42 53_markets_advice_bull_hodlers
54 eth - level - breakout - tradingview - analysts 38 54_eth_level_breakout_tradingview
55 nethereum - usd - struggling - resistance - performers 37 55_nethereum_usd_struggling_resistance
56 ecoterra - trending - swords - presale - neo 36 56_ecoterra_trending_swords_presale
57 securities - market - binance - coinbase - week 34 57_securities_market_binance_coinbase
58 staking - eigenlayer - sip - ethereum - tokens 33 58_staking_eigenlayer_sip_ethereum
59 founder - ethereum - forgotten - values - twitter 33 59_founder_ethereum_forgotten_values
60 bnb - bauer - upgrade - ecosystem - network 32 60_bnb_bauer_upgrade_ecosystem
61 price - rsi - bullish - chart - resistance 32 61_price_rsi_bullish_chart
62 expiry - week - billion - derivatives - bet 32 62_expiry_week_billion_derivatives
63 vasil - fork - mainnet - newest - scalability 31 63_vasil_fork_mainnet_newest
64 microstrategy - saylor - btc - rumor - billion 31 64_microstrategy_saylor_btc_rumor
65 metamask - browser - wallets - features - allows 31 65_metamask_browser_wallets_features
66 uae - east - chainalysis - singapore - emerging 31 66_uae_east_chainalysis_singapore
67 outflows - etps - products - week - funds 31 67_outflows_etps_products_week
68 polygon - zcash - kakarot - starknet - protocol 29 68_polygon_zcash_kakarot_starknet
69 japanese - jvcea - stablecoin - x93 - fatf 29 69_japanese_jvcea_stablecoin_x93
70 asic - miner - gpu - mi300x - ks3 28 70_asic_miner_gpu_mi300x
71 arrows - voyager - dcg - genesis - bankruptcy 28 71_arrows_voyager_dcg_genesis
72 axie - infinity - program - ronin - upgrades 26 72_axie_infinity_program_ronin
73 withdrawals - platform - freeway - halted - babel 26 73_withdrawals_platform_freeway_halted
74 addresses - eth - glassnode - underwater - cryptos 26 74_addresses_eth_glassnode_underwater
75 bottoming - dip - markets - chain - altcoins 25 75_bottoming_dip_markets_chain
76 mica - eu - conglomerates - jurisdictions - framework 25 76_mica_eu_conglomerates_jurisdictions
77 liquidations - resting - bid - order - 200 25 77_liquidations_resting_bid_order
78 listings - missed - announcements - usdt - exchanges 25 78_listings_missed_announcements_usdt
79 cbdc - ripple - border - imf - currencies 25 79_cbdc_ripple_border_imf
80 announcements - delisting - pair - listing - collection 24 80_announcements_delisting_pair_listing
81 treasury - mixers - sanctioning - github - prank 24 81_treasury_mixers_sanctioning_github
82 polkadot - parachains - auctions - opengov - referenda 24 82_polkadot_parachains_auctions_opengov
83 hedge - investors - crypto - traditional - enriquez 23 83_hedge_investors_crypto_traditional
84 level - resistance - cj - price - cryptocurrency 23 84_level_resistance_cj_price
85 nexo - citibank - vauld - acquisitions - launched 22 85_nexo_citibank_vauld_acquisitions
86 huobi - li - citing - pantronics - rumours 22 86_huobi_li_citing_pantronics
87 nft - textbook - pill - sweeney - x9caccessible 21 87_nft_textbook_pill_sweeney
88 bored - yacht - apecoin - justin - collection 21 88_bored_yacht_apecoin_justin
89 apecoin - pattern - chart - head - roc 21 89_apecoin_pattern_chart_head
90 subscription - investment - binance - dual - 06 20 90_subscription_investment_binance_dual
91 halving - correlation - nasdaq - 2024 - powell 20 91_halving_correlation_nasdaq_2024
92 announcements - delisting - listing - crypto - slice 20 92_announcements_delisting_listing_crypto
93 adoption - nigeria - kucoin - lawful - aza 18 93_adoption_nigeria_kucoin_lawful
94 staff - chatbot - layoffs - hr - terminations 18 94_staff_chatbot_layoffs_hr
95 ethereum - network - batching - costs - tx 18 95_ethereum_network_batching_costs
96 suarez - desantis - salary - city - candidate 18 96_suarez_desantis_salary_city
97 circle - stablecoin - integrating - cybavo - worldpay 17 97_circle_stablecoin_integrating_cybavo
98 stablecoins - paypal - plabasan - mhel - converge22 17 98_stablecoins_paypal_plabasan_mhel
99 week - tokens - tvl - locked - analytical 17 99_week_tokens_tvl_locked
100 impairment - company - holdings - incurred - btc 17 100_impairment_company_holdings_incurred
101 cbdc - familiarity - euro - ecb - respondents 17 101_cbdc_familiarity_euro_ecb
102 marketplace - opensea - popularize - ftx - teaming 16 102_marketplace_opensea_popularize_ftx
103 executive - leaving - bitstamp - genesis - samir 15 103_executive_leaving_bitstamp_genesis

Training hyperparameters

  • calculate_probabilities: False
  • language: None
  • low_memory: False
  • min_topic_size: 15
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 5
  • verbose: False

Framework versions

  • Numpy: 1.22.4
  • HDBSCAN: 0.8.29
  • UMAP: 0.5.3
  • Pandas: 1.5.3
  • Scikit-Learn: 1.2.2
  • Sentence-transformers: 2.2.2
  • Transformers: 4.30.2
  • Numba: 0.56.4
  • Plotly: 5.13.1
  • Python: 3.10.12
Downloads last month
6