MARTINI_enrich_BERTopic_jordansather

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_jordansather")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 40
  • Number of training documents: 5580
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 disinformation - trump - influencers - youtube - flynn 20 -1_disinformation_trump_influencers_youtube
0 disinformation - influencers - awakening - believe - grifters 3047 0_disinformation_influencers_awakening_believe
1 bitchute - gabtv - livestream - replays - embed 310 1_bitchute_gabtv_livestream_replays
2 vaxxers - vaccinated - pfizer - injections - mrna 201 2_vaxxers_vaccinated_pfizer_injections
3 ufo - pentagon - disclosure - drones - gatekeepers 176 3_ufo_pentagon_disclosure_drones
4 ghislaine - conspirators - lawsuit - latham - dismissed 150 4_ghislaine_conspirators_lawsuit_latham
5 qanon - shills - repost - sources - normies 139 5_qanon_shills_repost_sources
6 hypochlorite - hydroxychloroquine - dioxido - ivermectin - cl02 115 6_hypochlorite_hydroxychloroquine_dioxido_ivermectin
7 ballots - maricopa - rigged - statewide - republican 91 7_ballots_maricopa_rigged_statewide
8 qanonjohn - flynn - hypocritical - persecution - whistleblower 74 8_qanonjohn_flynn_hypocritical_persecution
9 fbi - antifa - capitol - alleged - jan 71 9_fbi_antifa_capitol_alleged
10 disinfo - qult_headquarters - nesara - scammy - weirdo 69 10_disinfo_qult_headquarters_nesara_scammy
11 fraudlewski - godlewki - liars - gregg - suing 66 11_fraudlewski_godlewki_liars_gregg
12 fednow - coinbase - banks - robinhood - deepfuckingvalue 62 12_fednow_coinbase_banks_robinhood
13 russia - nordstream - zelensky - globalists - invade 59 13_russia_nordstream_zelensky_globalists
14 scampeachment - donald - indicted - newsmax - courthouse 57 14_scampeachment_donald_indicted_newsmax
15 biden - wikileaks - laptop - joe - emails 55 15_biden_wikileaks_laptop_joe
16 truthsocial - newsom - launched - trolled - bots 54 16_truthsocial_newsom_launched_trolled
17 biden - kamala - newsom - putin - helluva 50 17_biden_kamala_newsom_putin
18 sunspots - auroras - geomagnetic - earthquakes - satellites 49 18_sunspots_auroras_geomagnetic_earthquakes
19 vaccines - remdesivir - graphene - poison - sinopeg 49 19_vaccines_remdesivir_graphene_poison
20 comey - mueller - dossier - indictments - sussman 49 20_comey_mueller_dossier_indictments
21 wuhan - coronaviruses - darpa - leaked - redfield 43 21_wuhan_coronaviruses_darpa_leaked
22 energy - geoengineering - tesla - turbines - decentralized 41 22_energy_geoengineering_tesla_turbines
23 twitter - musk - takeover - unban - shareholder 39 23_twitter_musk_takeover_unban
24 livestream - today - foxhole - chillin - 5pm 36 24_livestream_today_foxhole_chillin
25 fake - shillbots - qanonofficial - telegram - ladymelaniatrump 35 25_fake_shillbots_qanonofficial_telegram
26 johnmcafee - juan - larpdar - legit - griftin 34 26_johnmcafee_juan_larpdar_legit
27 charlies - simon - parkes - spoof - cloned 33 27_charlies_simon_parkes_spoof
28 shootings - pistol - texas - lunatics - manifesto 33 28_shootings_pistol_texas_lunatics
29 gmo - aspartame - pasteurized - vitamins - toxins 32 29_gmo_aspartame_pasteurized_vitamins
30 collagen - cordyceps - supplements - theanine - adaptogenic 32 30_collagen_cordyceps_supplements_theanine
31 ballots - illegitimately - smartmatic - senators - sheriff 30 31_ballots_illegitimately_smartmatic_senators
32 qanonjohn - flynn - soldiers - tyranny - disseminating 30 32_qanonjohn_flynn_soldiers_tyranny
33 hamas - israelis - bombed - terrorizers - blaming 28 33_hamas_israelis_bombed_terrorizers
34 livestreams - monday - tonight - hi - starlink 28 34_livestreams_monday_tonight_hi
35 airspace - montana - drones - missiles - fairchild 26 35_airspace_montana_drones_missiles
36 coinscammers - trumpcoinannouncements - scammer - bots - promoting 23 36_coinscammers_trumpcoinannouncements_scammer_bots
37 twitter - suspended - majorpatriot - unbanning - updated 23 37_twitter_suspended_majorpatriot_unbanning
38 whatevergender - transgenders - feminized - boobs - onlyfans 21 38_whatevergender_transgenders_feminized_boobs

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
5
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.