Model Card for mlpf-cms-2024_05_16_attn_model21M
This model reconstructs particles in a detector, based on the tracks and calorimeter clusters recorded by the detector.
Model Details
Model Description
- Developed by: Joosep Pata, Eric Wulff, Farouk Mokhtar, Mengke Zhang, David Southwick, Maria Girone, David Southwick, Javier Duarte
- Model type: transformer with 2x6 layers, 32 heads, head dim 16
- License: Apache License
Model Sources
Uses
Direct Use
This model may be used to study the physics and computational performance on ML-based reconstruction in simulation in the CMS collaboration.
Out-of-Scope Use
This model is not intended for physics measurements on real data our use outside the CMS collaboration.
Bias, Risks, and Limitations
The model has only been trained on simulation data and has not been validated against real data. It's only meant for internal CMS use.
Training Details
Approximately 2 weeks on 1x A100 80GB.
https://www.comet.com/jpata/particleflow-pt/9ceb52e8f9f54d7eb4ef06c9ff85bef2?compareXAxis=step&experiment-tab=panels&showOutliers=true&smoothing=0&xAxis=epoch
https://www.comet.com/jpata/particleflow-pt/35bb92e72a3846ff98fb563b0769be13?compareXAxis=step&experiment-tab=panels&showOutliers=true&smoothing=0&xAxis=epoch0
Training Data
CMS ttbar, QCD, Ztautau with pileup, v1.7.1, 400k events each.
134G /eos/user/j/jpata/mlpf/tensorflow_datasets/cms/cms_pf_qcd/1.7.1
135G /eos/user/j/jpata/mlpf/tensorflow_datasets/cms/cms_pf_ttbar/1.7.1
130G /eos/user/j/jpata/mlpf/tensorflow_datasets/cms/cms_pf_ztt/1.7.1
Training Procedure
#!/bin/bash
#SBATCH --partition gpu
#SBATCH --gres gpu:a100:1
#SBATCH --mem-per-gpu 80G
IMG=/home/software/singularity/pytorch.simg:2024-04-30
cd ~/particleflow
git checkout 8bd199fb064bb40558466d906d46498218848e5c
singularity exec --nv \
--env PYTHONPATH=hep_tfds \
--env KERAS_BACKEND=torch \
$IMG python3.10 mlpf/pyg_pipeline.py --dataset cms --gpus 1 \
--data-dir /path/to/tensorflow_datasets --config parameters/pytorch/pyg-cms.yaml \
--train --conv-type attention --num-epochs 100 --gpu-batch-multiplier 40 --num-workers 4 --prefetch-factor 50 --checkpoint-freq 1 --comet
Evaluation
WEIGHTS=pyg-cms_20240430_094836_751206/checkpoints/checkpoint-25-17.631161.pth
singularity exec -B /scratch/persistent --nv \
--env PYTHONPATH=hep_tfds \
--env KERAS_BACKEND=torch \
$IMG python3.10 mlpf/pyg_pipeline.py --dataset cms --gpus 1 \
--data-dir /path/to/joosep/tensorflow_datasets --config parameters/pytorch/pyg-cms.yaml \
--test --make-plots --conv-type attention --gpu-batch-multiplier 10 --num-workers 8 --prefetch-factor 10 --load $WEIGHTS --test-datasets cms_pf_ttbar --ntest 50000
Model Card Contact
Joosep Pata, [email protected]