BERTopic
This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
Usage
To use this model, please install BERTopic:
pip install -U bertopic
You can use the model as follows:
from bertopic import BERTopic
topic_model = BERTopic.load("keonju/BERTopic")
topic_model.get_topic_info()
Topic overview
- Number of topics: 158
- Number of training documents: 10158
Click here for an overview of all topics.
Topic ID | Topic Keywords | Topic Frequency | Label |
---|---|---|---|
-1 | and - the - of - in - to | 10 | -1_and_the_of_in |
0 | holocene - china - the - monsoon - bp | 3858 | 0_holocene_china_the_monsoon |
1 | energy - biofuels - production - biodiesel - bioenergy | 291 | 1_energy_biofuels_production_biodiesel |
2 | coal - coals - the - basin - seams | 248 | 2_coal_coals_the_basin |
3 | yr - holocene - the - bp - and | 205 | 3_yr_holocene_the_bp |
4 | hg - mercury - mehg - of hg - in | 202 | 4_hg_mercury_mehg_of hg |
5 | ch4 - methane - emissions - fluxes - flux | 159 | 5_ch4_methane_emissions_fluxes |
6 | data - forest - spectral - for - mapping | 118 | 6_data_forest_spectral_for |
7 | bp - the - holocene - pollen - lake | 116 | 7_bp_the_holocene_pollen |
8 | wetlands - wetland - and - are - of | 104 | 8_wetlands_wetland_and_are |
9 | co2 - ecosystem - nee - exchange - net | 103 | 9_co2_ecosystem_nee_exchange |
10 | species - of - fen - the - restoration | 100 | 10_species_of_fen_the |
11 | peat - tropical - peatlands - palm - peatland | 98 | 11_peat_tropical_peatlands_palm |
12 | pb - lead - atmospheric - metal - deposition | 96 | 12_pb_lead_atmospheric_metal |
13 | the - lake - of the - of - poland | 93 | 13_the_lake_of the_of |
14 | pm2 - haze - burning - air - aerosol | 90 | 14_pm2_haze_burning_air |
15 | doc - catchments - carbon - organic carbon - export | 88 | 15_doc_catchments_carbon_organic carbon |
16 | the - carbon - of - co2 - of the | 73 | 16_the_carbon_of_co2 |
17 | wetland - wetlands - classification - mapping - and | 69 | 17_wetland_wetlands_classification_mapping |
18 | uv - ozone - o3 - isoprene - elevated | 67 | 18_uv_ozone_o3_isoprene |
19 | mediterranean - the - glacial - iberian - during | 66 | 19_mediterranean_the_glacial_iberian |
20 | media - compost - growing media - growing - biochar | 63 | 20_media_compost_growing media_growing |
21 | 137cs - of 137cs - sup - ce sup - radiocaesium | 63 | 21_137cs_of 137cs_sup_ce sup |
22 | testate - amoebae - testate amoebae - of testate - amoeba | 62 | 22_testate_amoebae_testate amoebae_of testate |
23 | peat - pyrolysis - lignin - gc - of | 62 | 23_peat_pyrolysis_lignin_gc |
24 | cu - zn - metals - peat - elements | 62 | 24_cu_zn_metals_peat |
25 | alkanes - alkane - chain - values - plants | 61 | 25_alkanes_alkane_chain_values |
26 | permafrost - active layer - thermal - ground - layer | 60 | 26_permafrost_active layer_thermal_ground |
27 | streams - diatom - species - macroinvertebrate - stream | 60 | 27_streams_diatom_species_macroinvertebrate |
28 | records - the - of - record - ireland | 60 | 28_records_the_of_record |
29 | water - flow - groundwater - recharge - runoff | 59 | 29_water_flow_groundwater_recharge |
30 | habitat - species - breeding - bird - nest | 57 | 30_habitat_species_breeding_bird |
31 | brgdgts - gdgts - glycerol - brgdgt - branched | 56 | 31_brgdgts_gdgts_glycerol_brgdgt |
32 | deposition - nitrogen - nitrogen deposition - sphagnum - of | 55 | 32_deposition_nitrogen_nitrogen deposition_sphagnum |
33 | oil sands - sands - fen - oil - reclamation | 54 | 33_oil sands_sands_fen_oil |
34 | fire - burned - severity - burning - post fire | 54 | 34_fire_burned_severity_burning |
35 | acidification - deposition - acid - ph - catchment | 54 | 35_acidification_deposition_acid_ph |
36 | farm - land - agricultural - farmers - policy | 53 | 36_farm_land_agricultural_farmers |
37 | cdom - doc - dom - dissolved organic - dissolved | 53 | 37_cdom_doc_dom_dissolved organic |
38 | redd - indonesia - deforestation - in indonesia - forest | 50 | 38_redd_indonesia_deforestation_in indonesia |
39 | ash - wood ash - wood - growth - of wood | 49 | 39_ash_wood ash_wood_growth |
40 | fungal - fungi - mycorrhizal - species - root | 49 | 40_fungal_fungi_mycorrhizal_species |
41 | stand - growth - models - tree - stands | 49 | 41_stand_growth_models_tree |
42 | smouldering - smoldering - spread - peat - combustion | 49 | 42_smouldering_smoldering_spread_peat |
43 | pollen - of pollen - vegetation - of - from | 49 | 43_pollen_of pollen_vegetation_of |
44 | arsenic - as - of as - fe - of arsenic | 49 | 44_arsenic_as_of as_fe |
45 | ch4 - methane - production - peat - methanogenesis | 47 | 45_ch4_methane_production_peat |
46 | africa - the - bp - south - late | 46 | 46_africa_the_bp_south |
47 | soc - carbon - soil - stocks - land | 45 | 47_soc_carbon_soil_stocks |
48 | soil - organic - carbon - soil organic - soils | 45 | 48_soil_organic_carbon_soil organic |
49 | wetlands - constructed - wetland - treatment - phosphorus | 43 | 49_wetlands_constructed_wetland_treatment |
50 | microbial - rare - soil - bacterial - diversity | 43 | 50_microbial_rare_soil_bacterial |
51 | litter - decomposition - mass loss - litter decomposition - mass | 39 | 51_litter_decomposition_mass loss_litter decomposition |
52 | co2 - pco2 - emissions - carbon - ch4 | 39 | 52_co2_pco2_emissions_carbon |
53 | soc - carbon - wetland - wetlands - soil | 39 | 53_soc_carbon_wetland_wetlands |
54 | countries - emissions - emission - to - climate | 38 | 54_countries_emissions_emission_to |
55 | services - ecosystem - ecosystem services - es - pes | 37 | 55_services_ecosystem_ecosystem services_es |
56 | catalyst - peat - pyrolysis - char - catalysts | 37 | 56_catalyst_peat_pyrolysis_char |
57 | clearfelling - water - phosphorus - buffer - nutrient | 35 | 57_clearfelling_water_phosphorus_buffer |
58 | forest - forests - trees - tree - stands | 35 | 58_forest_forests_trees_tree |
59 | carbon - climate - atmosphere - earth - carbon cycle | 34 | 59_carbon_climate_atmosphere_earth |
60 | tephra - volcanic - cryptotephra - eruptions - tephras | 34 | 60_tephra_volcanic_cryptotephra_eruptions |
61 | testate - arcellinida - coi - species - amoebae | 34 | 61_testate_arcellinida_coi_species |
62 | methane - methanogenic - community - methanogen - methanogens | 34 | 62_methane_methanogenic_community_methanogen |
63 | consolidation - soil - embankment - road - the | 33 | 63_consolidation_soil_embankment_road |
64 | species - spider - bogs - spiders - habitat | 33 | 64_species_spider_bogs_spiders |
65 | evaporation - energy - model - was - the | 33 | 65_evaporation_energy_model_was |
66 | phosphorus - catchment - in - tp - concentrations | 33 | 66_phosphorus_catchment_in_tp |
67 | co2 - ch4 - marsh - wetland - emissions | 33 | 67_co2_ch4_marsh_wetland |
68 | runoff - peat - channels - flow - catchment | 33 | 68_runoff_peat_channels_flow |
69 | nutrient - nitrogen - fertilizer - litter - of | 32 | 69_nutrient_nitrogen_fertilizer_litter |
70 | brazil - bp - the - of - in the | 31 | 70_brazil_bp_the_of |
71 | tsunami - holocene - the - volcanic - deposits | 30 | 71_tsunami_holocene_the_volcanic |
72 | climate change - change - climate - biodiversity - ecosystem | 30 | 72_climate change_change_climate_biodiversity |
73 | gpr - resistivity - radar - penetrating - penetrating radar | 29 | 73_gpr_resistivity_radar_penetrating |
74 | holocene - the - andes - and - bp | 29 | 74_holocene_the_andes_and |
75 | permafrost - soc - soil - soils - arctic | 28 | 75_permafrost_soc_soil_soils |
76 | policy - forest - owners - arguments - forest owners | 28 | 76_policy_forest_owners_arguments |
77 | bog - poland - peatland - europe - ca | 28 | 77_bog_poland_peatland_europe |
78 | ch4 - oxidation - methane - paddy - aom | 28 | 78_ch4_oxidation_methane_paddy |
79 | enzyme - enzymes - eea - soil - activities | 28 | 79_enzyme_enzymes_eea_soil |
80 | channel - catchment - flow - bends - model | 28 | 80_channel_catchment_flow_bends |
81 | soil - soil science - science - of soil - eu | 27 | 81_soil_soil science_science_of soil |
82 | pahs - pah - polycyclic aromatic - polycyclic - aromatic | 27 | 82_pahs_pah_polycyclic aromatic_polycyclic |
83 | n2o - n2o emissions - emissions - emission - nitrous | 26 | 83_n2o_n2o emissions_emissions_emission |
84 | peat water - adsorption - electrocoagulation - brackish peat - brackish peat water | 26 | 84_peat water_adsorption_electrocoagulation_brackish peat |
85 | mangrove - mangroves - carbon - coastal - b2 | 26 | 85_mangrove_mangroves_carbon_coastal |
86 | species - retention - alien - richness - forests | 25 | 86_species_retention_alien_richness |
87 | colloidal - river - elements - fe - colloids | 25 | 87_colloidal_river_elements_fe |
88 | sulfate - sulfur - 34s - peat - sulphur | 24 | 88_sulfate_sulfur_34s_peat |
89 | caribou - habitat - woodland caribou - populations - wolf | 24 | 89_caribou_habitat_woodland caribou_populations |
90 | food - agriculture - food system - change - covid 19 | 24 | 90_food_agriculture_food system_change |
91 | microbial - community - microbial community - communities - bacterial | 23 | 91_microbial_community_microbial community_communities |
92 | sorption - cu - ions - ii - cu ii | 22 | 92_sorption_cu_ions_ii |
93 | fire - fires - algorithm - frp - hotspot | 22 | 93_fire_fires_algorithm_frp |
94 | choice - wtp - preferences - valuation - choice experiment | 22 | 94_choice_wtp_preferences_valuation |
95 | nematodes - earthworm - soil - food - nematode | 22 | 95_nematodes_earthworm_soil_food |
96 | conservation - orangutan - habitat - forest - species | 21 | 96_conservation_orangutan_habitat_forest |
97 | cushion - accumulation - peat - amazonian - vegetation | 21 | 97_cushion_accumulation_peat_amazonian |
98 | ch4 - oxidation - ch4 oxidation - uptake - ch4 uptake | 20 | 98_ch4_oxidation_ch4 oxidation_uptake |
99 | tidal - sediment - coastal - delta - the | 20 | 99_tidal_sediment_coastal_delta |
100 | emissions - co2 - ghg - n2o - table | 20 | 100_emissions_co2_ghg_n2o |
101 | methane - ph - cytochrome - methanotrophs - acetic acid | 20 | 101_methane_ph_cytochrome_methanotrophs |
102 | patterns - model - self organization - evolutionary - self | 20 | 102_patterns_model_self organization_evolutionary |
103 | nitrogen - denitrification - n2o - soil - n2 | 20 | 103_nitrogen_denitrification_n2o_soil |
104 | birch - rotation - biomass - buds - biomass production | 19 | 104_birch_rotation_biomass_buds |
105 | fire - wildfire - fires - wildfires - health | 19 | 105_fire_wildfire_fires_wildfires |
106 | grazing - heathland - heather - moorland - england | 19 | 106_grazing_heathland_heather_moorland |
107 | emissions - fire - burning - fire emissions - biomass burning | 19 | 107_emissions_fire_burning_fire emissions |
108 | peat - landslides - failure - of peat - peat compaction | 18 | 108_peat_landslides_failure_of peat |
109 | biochar - straw - soil - fe - bc | 18 | 109_biochar_straw_soil_fe |
110 | ecosystem - respiration - carbon - ecosystem respiration - meadow | 17 | 110_ecosystem_respiration_carbon_ecosystem respiration |
111 | wetland - wetlands - risk - of wetland - the wetland | 17 | 111_wetland_wetlands_risk_of wetland |
112 | dom - thm - groundwater - molecular - organic | 17 | 112_dom_thm_groundwater_molecular |
113 | geochemistry - landscape geochemistry - rocks - peat - mafic | 17 | 113_geochemistry_landscape geochemistry_rocks_peat |
114 | tundra - ch4 - n2o - fluxes - antarctic | 16 | 114_tundra_ch4_n2o_fluxes |
115 | cellulose - sphagnum - isotopic - isotope - δ18ocel | 16 | 115_cellulose_sphagnum_isotopic_isotope |
116 | solute - transport - chloride - peat - pore | 16 | 116_solute_transport_chloride_peat |
117 | charcoal - fire - fires - holocene - fire history | 15 | 117_charcoal_fire_fires_holocene |
118 | ghg - agricultural - dairy - abatement - emissions | 15 | 118_ghg_agricultural_dairy_abatement |
119 | palm - oil - palm oil - sustainability - industry | 15 | 119_palm_oil_palm oil_sustainability |
120 | humic - humic substances - substances - acids - fluorescence | 15 | 120_humic_humic substances_substances_acids |
121 | canopy - ndvi - pri - lue - phenological | 15 | 121_canopy_ndvi_pri_lue |
122 | pollen - bog - peat - the - human impact | 15 | 122_pollen_bog_peat_the |
123 | marshes - tidal - marshes are - salt - or | 15 | 123_marshes_tidal_marshes are_salt |
124 | soil - prediction - mapping - covariates - dsm | 15 | 124_soil_prediction_mapping_covariates |
125 | si - of si - silicon - biogenic - protozoic | 14 | 125_si_of si_silicon_biogenic |
126 | et - evapotranspiration - le - wetland - rice | 14 | 126_et_evapotranspiration_le_wetland |
127 | forest - finland - forests - stock - management | 14 | 127_forest_finland_forests_stock |
128 | iodine - 129i - sorption - iodide - the sorption | 14 | 128_iodine_129i_sorption_iodide |
129 | palm - oil - palm oil - smallholders - certification | 14 | 129_palm_oil_palm oil_smallholders |
130 | dndc - model - models - soil - carbon | 14 | 130_dndc_model_models_soil |
131 | snow - thaw - cover - sca - data | 14 | 131_snow_thaw_cover_sca |
132 | stx2 - microbiota - gut - gut microbiota - microbial | 13 | 132_stx2_microbiota_gut_gut microbiota |
133 | dom - doc - organic - dissolved organic - of dom | 13 | 133_dom_doc_organic_dissolved organic |
134 | forest - cbm - ontario - cfs3 - cbm cfs3 | 13 | 134_forest_cbm_ontario_cfs3 |
135 | wind - wind farms - farms - onshore - onshore wind | 13 | 135_wind_wind farms_farms_onshore |
136 | uranium - of uranium - 232th - th - ar | 13 | 136_uranium_of uranium_232th_th |
137 | groundwater - springs - spring - gdes - discharge | 13 | 137_groundwater_springs_spring_gdes |
138 | fire - forest - boreal - burned - fires | 13 | 138_fire_forest_boreal_burned |
139 | metal - metals - cd - sediments - zn | 13 | 139_metal_metals_cd_sediments |
140 | slr - sea level - coastal - sea - sea level rise | 13 | 140_slr_sea level_coastal_sea |
141 | damo - methane - anaerobic - oxidation - aom | 12 | 141_damo_methane_anaerobic_oxidation |
142 | temperature - microbial - soil - co2 - pd | 12 | 142_temperature_microbial_soil_co2 |
143 | soil - respiration - root - soil respiration - enchytraeid | 12 | 143_soil_respiration_root_soil respiration |
144 | kerp - fusiformisporites - permian - genus - flora | 11 | 144_kerp_fusiformisporites_permian_genus |
145 | dust - dust deposition - dust sources - deposition - atmospheric dust | 11 | 145_dust_dust deposition_dust sources_deposition |
146 | methane - sources - ch4 - les - de | 11 | 146_methane_sources_ch4_les |
147 | n2o - n2o emissions - emissions - permafrost - n2o fluxes | 11 | 147_n2o_n2o emissions_emissions_permafrost |
148 | australia - mis - record - ka - crater | 11 | 148_australia_mis_record_ka |
149 | oc - fjords - fjord - lakes - of oc | 10 | 149_oc_fjords_fjord_lakes |
150 | fe - reduction - fe iii - sr10 - iron | 10 | 150_fe_reduction_fe iii_sr10 |
151 | loading - eutrophication - nitrogen - coastal - phytoplankton | 10 | 151_loading_eutrophication_nitrogen_coastal |
152 | model - wetlands - groundwater - water - the wetlands | 10 | 152_model_wetlands_groundwater_water |
153 | co2 - soil - co2 efflux - soil co2 efflux - soil co2 | 10 | 153_co2_soil_co2 efflux_soil co2 efflux |
154 | transfer - transfer functions - transfer function - testate - functions | 10 | 154_transfer_transfer functions_transfer function_testate |
155 | peat - spain - bog - matter - autofluorescent | 10 | 155_peat_spain_bog_matter |
156 | isbas - insar - subsidence - motion - deformation | 10 | 156_isbas_insar_subsidence_motion |
Training hyperparameters
- calculate_probabilities: False
- language: None
- low_memory: False
- min_topic_size: 10
- n_gram_range: (1, 3)
- nr_topics: None
- seed_topic_list: None
- top_n_words: 30
- verbose: False
Framework versions
- Numpy: 1.22.4
- HDBSCAN: 0.8.29
- UMAP: 0.5.3
- Pandas: 1.5.3
- Scikit-Learn: 1.2.2
- Sentence-transformers: 2.2.2
- Transformers: 4.30.2
- Numba: 0.56.4
- Plotly: 5.13.1
- Python: 3.10.12
- Downloads last month
- 7
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.