MARTINI_enrich_BERTopic_orlaredchan

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_orlaredchan")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 30
  • Number of training documents: 4472
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 migrants - gardai - protest - covid - ngos 21 -1_migrants_gardai_protest_covid
0 illegals - refugee - irishtimes - nigeria - ukrainians 2925 0_illegals_refugee_irishtimes_nigeria
1 irishmen - ulster - nationalist - cailleach - surrender 164 1_irishmen_ulster_nationalist_cailleach
2 abduction - kilkenny - daughter - chased - garda 114 2_abduction_kilkenny_daughter_chased
3 raped - accused - convictions - jailed - pakistan 99 3_raped_accused_convictions_jailed
4 blanchardstown - drogheda - gangs - carraigaline - stabbed 98 4_blanchardstown_drogheda_gangs_carraigaline
5 africans - slavery - irishness - shelbourne - majekodunmi 89 5_africans_slavery_irishness_shelbourne
6 fraudsters - scammed - nigeria - laundering - suspect 88 6_fraudsters_scammed_nigeria_laundering
7 kildarenow - finglas - accommodation - protested - blessington 73 7_kildarenow_finglas_accommodation_protested
8 nationalities - census - scotland - percentage - majority 68 8_nationalities_census_scotland_percentage
9 censorship - criminalise - legislation - bunreacht - mcguirk 65 9_censorship_criminalise_legislation_bunreacht
10 quarantine - superspreading - tanaiste - airports - britain 57 10_quarantine_superspreading_tanaiste_airports
11 migrants - european - lukashenko - poland - merkel 51 11_migrants_european_lukashenko_poland
12 oats - silage - farmers - corrib - imported 48 12_oats_silage_farmers_corrib
13 transgenderism - pedophiles - heteronormativity - nhs - teachers 46 13_transgenderism_pedophiles_heteronormativity_nhs
14 vaccine - pfizer - gsk - snopes - waterford 46 14_vaccine_pfizer_gsk_snopes
15 france - zemmour - aoussaoui - islamist - rioting 42 15_france_zemmour_aoussaoui_islamist
16 murder - tullamore - jozef - ashling - jurors 40 16_murder_tullamore_jozef_ashling
17 irelandisfull - rents - migrants - ronan - soros 36 17_irelandisfull_rents_migrants_ronan
18 stabbings - gaelscoil - suspected - algerian - parnell 33 18_stabbings_gaelscoil_suspected_algerian
19 freeyoutubeapp - telegram - bigtaboochannel - uploaded - subscribers 32 19_freeyoutubeapp_telegram_bigtaboochannel_uploaded
20 paedophile - tatchell - lgbt - cunnilingus - advocated 31 20_paedophile_tatchell_lgbt_cunnilingus
21 volunteersforukraine - zelensky - putin - libya - nazification 30 21_volunteersforukraine_zelensky_putin_libya
22 asylum - kinnegad - committee - screened - hotels 30 22_asylum_kinnegad_committee_screened
23 protestors - ballsbridge - roadblocks - constituency - bohemians 28 23_protestors_ballsbridge_roadblocks_constituency
24 antifa - protesters - kilraine - idiots - corcoran 25 24_antifa_protesters_kilraine_idiots
25 birthright - deported - passport - niall - amendment 25 25_birthright_deported_passport_niall
26 shitlibs - retweeters - donegaldaily - hashtags - campaigned 24 26_shitlibs_retweeters_donegaldaily_hashtags
27 gypsies - slovakia - begging - robbing - discrimination 22 27_gypsies_slovakia_begging_robbing
28 mosques - clonskeagh - impic_ie - qadri - extremists 22 28_mosques_clonskeagh_impic_ie_qadri

Training hyperparameters

  • calculate_probabilities: True
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: None
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: False
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.26.4
  • HDBSCAN: 0.8.40
  • UMAP: 0.5.7
  • Pandas: 2.2.3
  • Scikit-Learn: 1.5.2
  • Sentence-transformers: 3.3.1
  • Transformers: 4.46.3
  • Numba: 0.60.0
  • Plotly: 5.24.1
  • Python: 3.10.12
Downloads last month
5
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.