# MatchingPubs ## Dataset The `dataset/` directory contains the following main classes: - `DatasetLoader`: Responsible for loading the dataset from various sources. - `DatasetProcessor`: Handles preprocessing and cleaning of the dataset. - `DatasetAnalyzer`: Provides methods for analyzing and summarizing the dataset. ## Getting the Dataset To get the dataset, run the following command: ```bash PYTHONPATH=$(pwd) python src/dataset/get_dataset.py