Edit model card

BERTopic_gregoryroose

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("sdantonio/BERTopic_gregoryroose")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 34
  • Number of training documents: 5019
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 france - confinement - paris - january - february 11 -1_france_confinement_paris_january
0 france - remplacement - chers - rn - chai 726 0_france_remplacement_chers_rn
1 france - confinement - musulmans - migrants - january 2122 1_france_confinement_musulmans_migrants
2 france - confinement - valeurs - adoxainfos - attestation 383 2_france_confinement_valeurs_adoxainfos
3 france - fabriquer - inondation - pleutrerie - noirs 215 3_france_fabriquer_inondation_pleutrerie
4 france - confinement - arbitre - migrants - victimes 171 4_france_confinement_arbitre_migrants
5 caceuphonie - 1950s - gregoryroose - injusticepouradrien - lundiabstinence 141 5_caceuphonie_1950s_gregoryroose_injusticepouradrien
6 france - confinement - morts - centrales - reconquete 127 6_france_confinement_morts_centrales
7 islamofolie - confinement - lea_antiracisme - valeurs - actuelles 126 7_islamofolie_confinement_lea_antiracisme_valeurs
8 france - musulmans - manifstopislamisme - migrants - gilets 108 8_france_musulmans_manifstopislamisme_migrants
9 france - migrants - moire - saint - liberte 101 9_france_migrants_moire_saint
10 nd675i9efw - xyc9onz4u6 - q6vpvgl3y8 - 7oft6k1w0t - fx1wrgvf62 83 10_nd675i9efw_xyc9onz4u6_q6vpvgl3y8_7oft6k1w0t
11 lf5oyn1fv1 - w6uo2fhmmv - xmsnt2c3i4 - rahvt7fxwq - coor4crsqz 64 11_lf5oyn1fv1_w6uo2fhmmv_xmsnt2c3i4_rahvt7fxwq
12 confinement - tegner - saoudien - continental - dissolutionccif 62 12_confinement_tegner_saoudien_continental
13 wfh0de8qtc - z7p2rmw7a0 - oi5af1xkjs - 4cgk8oudwa - xyrgxsovtb 54 13_wfh0de8qtc_z7p2rmw7a0_oi5af1xkjs_4cgk8oudwa
14 france - onveutlesnoms - tvlofficiel - fabriquer - musulmans 50 14_france_onveutlesnoms_tvlofficiel_fabriquer
15 france - confinement - racismes - migrants - aristocratique 45 15_france_confinement_racismes_migrants
16 germains - gationnisme - subversif - flexitarien - gory 44 16_germains_gationnisme_subversif_flexitarien
17 clairs - foehn - candidatures - liberte - sifflet 39 17_clairs_foehn_candidatures_liberte
18 ugsbuxrvm1 - zfxy482pj7 - slpng_giants_fr - ton6pf8fjf - z9zsctlaw1 38 18_ugsbuxrvm1_zfxy482pj7_slpng_giants_fr_ton6pf8fjf
19 greenconservatism - morts - gardetonvoile - treligionopeace - bandes 35 19_greenconservatism_morts_gardetonvoile_treligionopeace
20 ojim_france - boycottfrance - borisjohnson - claudechollet - confinementsaison2 32 20_ojim_france_boycottfrance_borisjohnson_claudechollet
21 paronym_france - caricatural - jesuistellementblancque - valeurs - pape 31 21_paronym_france_caricatural_jesuistellementblancque_valeurs
22 france - hypocrite - occurrence - lune - militants 29 22_france_hypocrite_occurrence_lune
23 france - jambanja - remplacement - sanctions - paul 26 23_france_jambanja_remplacement_sanctions
24 boycottdecathlon - france - boycott - prochaines - accouchement 25 24_boycottdecathlon_france_boycott_prochaines
25 sifaoui - boycottdecathlon - miyandoab - victimes - gory 20 25_sifaoui_boycottdecathlon_miyandoab_victimes
26 morts - souvenirs - murmure - engueuler - thermale 20 26_morts_souvenirs_murmure_engueuler
27 emmanuelmacron - vincent_vauclin - franceinter - veillez - prises 18 27_emmanuelmacron_vincent_vauclin_franceinter_veillez
28 paronym_france - medinrecords - merciauxsoignants - krgkb7lfw3 - f9m1vggm6g 18 28_paronym_france_medinrecords_merciauxsoignants_krgkb7lfw3
29 azecmnrh5t0 - pxdoezdfwj - ubqgl9qtfs - bqwibkek5p - cvp5sypmkg 15 29_azecmnrh5t0_pxdoezdfwj_ubqgl9qtfs_bqwibkek5p
30 france - pylo - january - cologisme - february 14 30_france_pylo_january_cologisme
31 confinement - valeurs - adoxainfos - conflanssaintehonorine - iran 13 31_confinement_valeurs_adoxainfos_conflanssaintehonorine
32 confinementjour6 - boycottdecathlon - exceptionnelle - marie - gilets 13 32_confinementjour6_boycottdecathlon_exceptionnelle_marie

Training hyperparameters

  • calculate_probabilities: False
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.23.5
  • HDBSCAN: 0.8.38.post1
  • UMAP: 0.5.6
  • Pandas: 2.2.2
  • Scikit-Learn: 1.5.1
  • Sentence-transformers: 3.0.1
  • Transformers: 4.44.2
  • Numba: 0.60.0
  • Plotly: 5.24.0
  • Python: 3.10.12
Downloads last month
1
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.